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Preface 


Research in cosmology has become extraordinarily lively in the past quarter 
century. In the early 1980s the proposal of the theory of inflation offered a 
solution to some outstanding cosmological puzzles and provided a 
mechanism for the origin of large-scale structure, which could be tested by 
observations of anisotropies in the cosmic microwave background. 
November 1989 saw the launch of the Cosmic Background Explorer 
Satellite. Measurements with its spectrophotometer soon established the 
thermal nature of the cosmic microwave background and determined its 
temperature to three decimal places, a precision unprecedented in cosmol- 
ogy. A little later the long-sought microwave background anisotropies were 
found in data taken by the satellite’s radiometer. Subsequent observa- 
tions by ground-based and balloon-borne instruments and eventually by 
the Wilkinson Microwave Anisotropy Probe showed that these anisotropies 
are pretty much what would be expected on the basis of inflationary theory. 
In the late 1990s the use of Type Ia supernovae as standard candles led to 
the discovery that the expansion of the universe is accelerating, implying 
that most of the energy of the universe is some sort of dark energy, with a 
ratio of pressure to density less than —1/3. This was confirmed by precise 
observations of the microwave background anisotropies, and by massive 
surveys of galaxies, which together provided increasingly accurate values 
for cosmological parameters. 

Meanwhile, the classic methods of astronomy have provided steadily 
improving independent constraints on the same cosmological parameters. 
The spectroscopic discovery of thorium and then uranium in the atmo- 
spheres of old stars, together with continued study of the turn-off from the 
main sequence in globular clusters, has narrowed estimates of the age of the 
universe. The measurement of the deuterium to hydrogen ratio in interstel- 
lar absorption combined with calculations of cosmological nucleosynthesis 
has given a good value for the cosmic density of ordinary baryonic mat- 
ter, and shown that it is only about a fifth of the density of some sort of 
mysterious non-baryonic cold dark matter. Observations with the Hubble 
Space Telescope as well as ground-based telescopes have given increasingly 
precise values for the Hubble constant. It is greatly reassuring that some of 
the parameters measured by these other means have values consistent with 
those found in studies of the cosmic microwave background and large scale 
structure. 

Progress continues. In the years to come, we can expect definite 
information about whether the dark energy density is constant or evolv- 
ing, and we hope for signs of gravitational radiation that would open the 
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era of inflation to observation. We may discover the nature of dark mat- 
ter, either by artificially producing dark matter particles at new large 
accelerators, or by direct observation of natural dark matter particles imping- 
ing on the earth. It remains to be seen if in our times fundamental physical 
theory can provide a specific theory of inflation or explain dark matter or 
dark energy. 

This new excitement in cosmology came as if on cue for elementary 
particle physicists. By the 1980s the Standard Model of elementary particles 
and fields had become well established. Although significant theoretical 
and experimental work continued, there was now little contact between 
experiment and new theoretical ideas, and without this contact, particle 
physics lost much of its liveliness. Cosmology now offered the excitement 
that particle physicists had experienced in the 1960s and 1970s. 

In 1999 I finished my three-volume book on the quantum theory of fields 
(cited here as “QTF”), and with unaccustomed time on my hands, I set 
myself the task of learning in detail the theory underlying the great progress 
in cosmology made in the previous two decades. Although I had done some 
research on cosmology in the past, getting up to date now turned out to take 
a fair amount of work. Review articles on cosmology gave good summaries 
of the data, but they often quoted formulas without giving the derivation, 
and sometimes even without giving a reference to the original derivation. 
Occasionally the formulas were wrong, and therefore extremely difficult 
for me to rederive. Where I could find the original references, the articles 
sometimes had gaps in their arguments, or relied on hidden assumptions, or 
used unexplained notation. Often massive computer programs had taken 
the place of analytic studies. In many cases I found that it was easiest to 
work out the relevant theory for myself. 

This book is the result. Its aim is to give self-contained explanations 
of the ideas and formulas that are used and tested in modern cosmological 
observations. The book divides into two parts, each of which in my exp- 
erience teaching the subject provides enough material for a one-semester 
graduate course. The first part, Chapters 1 through 4, deals chiefly with the 
isotropic and homogeneous average universe, with only a brief introduc- 
tion to the anisotropies in the microwave background in Section 2.6. These 
chapters are more-or-less in reverse chronological order; Chapter 1 concen- 
trates on the universe since the formation of galaxies, corresponding roughly 
to redshifts z < 10; Chapter 2 deals with the microwave background, emit- 
ted at a redshift z ~ 1,000; Chapter 3 describes the early universe, from 
the beginning of the radiation-dominated expansion to a redshift z ~ 104 
when the density of radiation fell below that of matter; and Chapter 4 takes 
up the period of inflation that is believed to have preceded the radiation- 
dominated era. The second part, Chapters 5 through 10, concentrates on 
the departures from the average universe. After some general formalism 
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in Chapter 5 and its application to the evolution of inhomogeneities in 
Chapter 6, I return in Chapter 7 to the microwave background anisotropies, 
and take up the large scale structure of matter in Chapter 8. Gravitational 
lensing is discussed late, in Chapter 9, because its most important cosmo- 
logical application may be in the use of weak lensing to study large scale 
structure. The treatment of inflation in Chapter 4 deals only with the aver- 
age properties of the universe in the inflationary era; I return to inflation in 
Chapter 10, which discusses the growth of inhomogeneities from quantum 
fluctuations during inflation. 

To the greatest extent possible, I have tried throughout this book to 
present analytic calculations of cosmological phenomena, and not just report 
results obtained elsewhere by numerical computation. The calculations that 
are used in the literature to compare observation with theory necessarily 
take many details into account, which either make an analytic treatment 
impossible, or obscure the main physical features of the calculation. Where 
this is the case, I have not hesitated to sacrifice some degree of accuracy 
for greater transparency. This is especially the case in the hydrodynam- 
ical treatment of cosmic fluctuations in Sections 6.2 through 6.5, and in 
the treatment of large scale structure in Chapter 8. But in Section 6.1 
and Appendix H I also give an account of the more accurate kinetic the- 
ory on which the modern cosmological computer codes are based. Both 
approaches are applied to the cosmic microwave back ground anisotropies in 
Chapter 7. 

So much has happened in cosmology since the 1960s that this book 
necessarily bears little resemblance to my 1972 treatise, Gravitation and 
Cosmology. On occasion I refer back to that book (cited here as “G&C”) 
for material that does not seem worth repeating here. Classical general 
relativity has not changed much since 1972 (apart from a great strengthen- 
ing of its experimental verification) so it did not seem necessary to cover 
gravitation as well as cosmology in the present book. However, as a conve- 
nience to readers who want to refresh their knowledge of general relativity, 
and to establish my notation, I provide a brief introduction to general rel- 
ativity in Appendix B. Other appendices deal with technical material that is 
needed here and there in the book. I have also supplied at the back of this 
book a glossary of symbols that are used in more than one section and an 
assortment of problems. 

In order to keep the book to manageable proportions, I decided to 
exclude material that was highly speculative. Thus this book does not go 
into cosmological theory in higher dimensions, or anthropic reasoning, or 
holographic cosmology, or conjectures about the details of inflation, or 
many other new ideas. I may perhaps include some of them in a follow- 
up volume. The present book is largely concerned with what has become 
mainstream cosmology: a scenario according to which inflation driven by 
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one or more scalar fields is followed by a big bang dominated by radiation, 
cold dark matter, baryonic matter, and vacuum energy. 

I believe that the discussion of topics that are treated in this book is up 
to date as of 200n, where n is an integer that varies from 1 to 7 through 
different parts of the book. I have tried to give full references to the relevant 
astrophysical literature up to these dates, but I have doubtless missed some 
articles. The mere absence of a literature reference should not be interpreted 
as a claim that the work presented is original, though perhaps some of it 
is. Where I knew them, I included references to postings in the Cornell 
archive, http: //arxiv.org, as well as to the published literature. In 
some cases I had to list only the Cornell archive number, where the article in 
question had not yet appeared in print, or where it had never been submit- 
ted to publication. I have quoted the latest measurements of cosmological 
parameters known to me, in part because I want to give the reader a sense 
of what is now observationally possible. But I have not tried to combine 
measurements from observations of different types, because I did not think 
that it would add any additional physical insight, and any such cosmological 
concordance would very soon be out of date. 

I owe a great debt to my colleagues at the University of Texas, includ- 
ing Thomas Barnes, Fritz Benedict, Willy Fischler, Karl Gebhardt, Patrick 
Greene, Richard Matzner, Paul Shapiro, Craig Wheeler, and especially 
Duane Dicus, who did some of the numerical calculations and supplied 
many corrections. Iam grateful above all among these colleagues to Eiichiro 
Komatsu, who read through a draft of the manuscript and was a never- 
failing source of insight and information about cosmological research. 
I received much help with figures and calculations from my research stu- 
dent Raphael Flauger, and I was warned of numerous errors by Flauger and 
other students: Yingyue Li Boretz, Kannokkuan Chaicherdsakul, Bo Li, 
Ian Roederer, and Yuki Watanabe. Matthew Anderson helped with numeri- 
cal calculations of cosmological nucleosynthesis. I have also benefited much 
from correspondence on special topics with Ed Bertschinger, Dick Bond, 
Latham Boyle, Robert Cahn, Alan Guth, Robert Kirshner, Andrei Linde, 
Eric Linder, Viatcheslav Mukhanov, Saul Perlmutter, Jonathan Pritchard, 
Adam Riess, Uros Seljak, Paul Steinhardt, Edwin Turner, and Matias 
Zaldarriaga. Thanks are also due to Jan Duffy and Terry Riley for many 
helps. Of course, I alone am responsible for any errors that may remain in the 
book. I hope that readers will let me know of any mistakes they may notice; 
I will post them on a web page, http://zippy.ph.utexas.edu/ 
~weinberg/corrections.html. 


Austin, Texas 
June 2007 
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Notation 


Latin indices i, j, k, and so on generally run over the three spatial coordinate 
labels, usually taken as 1, 2, 3. 


Greek indices u, v, etc. generally run over the four spacetime coordinate 
labels 1, 2, 3, 0, with x? the time coordinate. 


Repeated indices are generally summed, unless otherwise indicated. 


The flat spacetime metric n,» is diagonal, with elements n11 = n22 = N33 = 
1, noo = —1. 


Spatial three-vectors are indicated by letters in boldface. 


A hat over any vector indicates the corresponding unit vector: Thus, = 
v/v]. 
A dot over any quantity denotes the time-derivative of that quantity. 
2 32 32 
d(x!)2 + 3 (x2)? t a (x3)? 


V? is the Laplacian, 


Except on vectors and tensors, a subscript 0 denotes the present time. 


On densities, pressures, and velocities, the subscripts B, D, y, and v refer 
respectively to the baryonic plasma (nuclei plus electrons), cold dark matter, 
photons, and neutrinos, while the subscripts M and R refer respectively to 
non-relativistic matter (baryonic plasma plus cold dark matter) and radia- 
tion (photons plus neutrinos). 


The complex conjugate, transpose, and Hermitian adjoint of a matrix or 
vector A are denoted A*, AT, and At = A*!, respectively. +H.c. or +c.c. 
at the end of an equation indicates the addition of the Hermitian adjoint or 
complex conjugate of the foregoing terms. 


Beginning in Chapter 5, a bar over any symbol denotes its unperturbed 
value. 


In referring to wave numbers, q is used for co-moving wave numbers, with 
an arbitrary normalization of the Robertson—Walker scale factor a(t), while 
k is the present value q/aọ of the corresponding physical wave number 
q/a(t). (N.B. This differs from the common practice of using k for the 
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co-moving wave number, with varying conventions for the normalization 
of a(t).) 


Except where otherwise indicated, we use units with Å and the speed of light 
taken to be unity. Throughout —e is the rationalized charge of the electron, 
so that the fine structure constant is œ = e? /4m ~ 1/137. 


Numbers in parenthesis at the end of quoted numerical data give the 
uncertainty in the last digits of the quoted figure. 


For other symbols used in more than one section, see the Glossary of 
Symbols on page 565. 
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The Expansion of the Universe 


The visible universe seems the same in all directions around us, at least if 
we look out to distances larger than about 300 million light years.! The 
isotropy is much more precise (to about one part in 107°) in the cosmic 
microwave background, to be discussed in Chapters 2 and 7. As we will 
see there, this radiation has been traveling to us for about 14 billion years, 
supporting the conclusion that the universe at sufficiently large distances is 
nearly the same in all directions. 

It is difficult to imagine that we are in any special position in the universe, 
so we are led to conclude that the universe should appear isotropic to 
observers throughout the universe. But not to all observers. The universe 
does not seem at all isotropic to observers in a spacecraft whizzing through 
our galaxy at half the speed of light. Such observers will see starlight and 
the cosmic microwave radiation background coming toward them from the 
direction toward which they are moving with much higher intensity than 
from behind. In formulating the assumption of isotropy, one should spec- 
ify that the universe seems the same in all directions to a family of “typical” 
freely falling observers: those that move with the average velocity of typical 
galaxies in their respective neighborhoods. That is, conditions must be the 
same at the same time (with a suitable definition of time) at any points that 
can be carried into each other by a rotation about any typical galaxy. But any 
point can be carried into any other by a sequence of such rotations about var- 
ious typical galaxies, so the universe is then also homogeneous — observers 
in all typical galaxies at the same time see conditions pretty much the same.” 

The assumption that the universe is isotropic and homogeneous will 
lead us in Section 1.1 to choose the spacetime coordinate system so that the 
metric takes a simple form, first worked out by Friedmann? as a solution 
of the Einstein field equations, and then derived on the basis of isotropy 
and homogeneity alone by Robertson* and Walker.> Almost all of modern 
cosmology is based on this Robertson—Walker metric, at least as a first 


IK_K.S. Wu, O. Lahav, and M. J. Rees, Nature 397, 225 (January 21, 1999). For a contrary view, see 
P. H. Coleman, L. Pietronero, and R. H. Sanders, Astron. Astrophys. 200, L32 (1988): L. Pietronero, 
M. Montuori, and F. Sylos-Labini, in Critical Dialogues in Cosmology, (World Scientific, Singapore, 
1997): 24; F. Sylos-Labini, F. Montuori, and L. Pietronero, Phys. Rep. 293, 61 (1998). 

2The Sloan Digital Sky Survey provides evidence that the distribution of galaxies is homogeneous 
on scales larger than about 300 light years; see J. Yadav, S. Bharadwaj, B. Pandey, and T. R. Seshadri, 
Mon. Not. Roy. Astron. Soc. 364, 601 (2005) [astro-ph/05043 15). 

3A. Friedmann, Z. Phys. 10, 377 (1922); ibid. 21, 326 (1924). 

4H. P. Robertson, Astrophys. J. 82, 284 (1935); ibid., 83, 187, 257 (1936). 

5A. G. Walker, Proc. Lond. Math. Soc. (2) 42, 90 (1936). 


1 


1 The Expansion of the Universe 


approximation. The observational implications of these assumptions are 
discussed in Sections 1.2-1.4, without reference to any dynamical assump- 
tions. The Einstein field equations are applied to the Robertson—Walker 
metric in Section 1.5, and their consequences are then explored in 
Sections 1.6-1.13. 


1.1 Spacetime geometry 


As preparation for working out the spacetime metric, we first consider the 
geometry of a three-dimensional homogeneous and isotropic space. As 
discussed in Appendix B, geometry is encoded in a metric g(x) (with i 
and j running over the three coordinate directions), or equivalently in a line 
element ds? = gi dx! dx! , with summation over repeated indices understood. 
(We say that ds is the proper distance between x and x + dx, meaning that 
it is the distance measured by a surveyor who uses a coordinate system that 
is Cartesian in a small neighborhood of the point x.) One obvious homo- 
geneous isotropic three-dimensional space with positive definite lengths is 
flat space, with line element 


ds? = dx’. (1.1.1) 


The coordinate transformations that leave this invariant are here simply 
ordinary three-dimensional rotations and translations. Another fairly 
obvious possibility is a spherical surface in four-dimensional Euclidean 
space with some radius a, with line element 


d? = dx? +d, 2+x =a. (1.1.2) 


Here the transformations that leave the line element invariant are four- 
dimensional rotations; the direction of x can be changed to any other 
direction by a four-dimensional rotation that leaves z unchanged (that is, an 
ordinary three-dimensional rotation), while x can be carried into any other 
point by a four-dimensional rotation that does change z. It can be proved? 
that the only other possibility (up to a coordinate transformation) is a 
hyperspherical surface in four-dimensional pseudo-Euclidean space, with 
line element 


d? = dx? — d?, 2- x =a’, (1.1.3) 


where a? is (so far) an arbitrary positive constant. The coordinate trans- 
formations that leave this invariant are four-dimensional pseudo-rotations, 
just like Lorentz transformations, but with z instead of time. 


6See S. Weinberg, Gravitation and Cosmology (John Wiley & Sons, New York, 1972) [quoted below 
as G&C], Sec. 13.2. 


1.1 Spacetime geometry 
We can rescale coordinates 
x’ =ax, Z =. (1.1.4) 


Dropping primes, the line elements in the spherical and hyperspherical 
cases are 


ds* =a?|dx? tdh, ?2+x =l. (1.1.5) 


The differential of the equation z? + x? = 1 gives zdz = FX - dx so 


ds? = a°|dx +r Hs (1.1.6) 
We can extend this to the case of Euclidean space by writing it as 
2 2 2 (x - dx)? 
ds =a [ax +K py 5 (1.1.7) 
where 
+1 spherical 
K= 34 -l hyperspherical (1.1.8) 


0 Euclidean 


(The constant K is often written as k, but we will use upper case for this 
constant throughout this book to avoid confusion with the symbols for wave 
number or for a running spatial coordinate index.) Note that we must take 
a’ > 0 in order to have ds? positive at x = 0, and hence everywhere. 

There is an obvious way to extend this to the geometry of spacetime: just 
include a term (1.1.7) in the spacetime line element, with a now an arbitrary 
function of time (known as the Robertson—Walker scale factor): 


Oi: i 2 2 2 (x - dx)? 
dt” = —gyy(x)dx"dx” = dt” — a (t) [ax + . (1.1.9) 


Another theorem’ tells us that this is the unique metric (up to a coordinate 
transformation) if the universe appears spherically symmetric and isotropic 
to a set of freely falling observers, such as astronomers in typical galaxies. 
The components of the metric in these coordinates are: 

xixi 


g= 00 (3 +K a), gio =0, goo =-—l1, (1.1.10) 


TG&C, Sec. 13.5. 
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with i and j running over the values 1, 2, and 3, and with x? = t the time 
coordinate in our units, with the speed of light equal to unity. Instead of 
the quasi-Cartesian coordinates x’, we can use spherical polar coordinates, 
for which 


dx? =dr+rdQ2, d2=d6? + sin? 6d¢? . 
SO 


dr? 
a +0] . (1.1.11) 


dr? = dt? —a’(t) l 


in which case the metric becomes diagonal, with 
2 
a“ (t) ; 
Err = IL K2’ goo = a (t)r? > Epp = AOLA sin? 0 » go=—l. 
(1.1.12) 


We will see in Section 1.5 that the dynamical equations of cosmology 
depend on the overall normalization of the function a(t) only through a 
term K/a’(t), so for K = 0 this normalization has no significance; all that 
matters are the ratios of the values of a(t) at different times. 

The equation of motion of freely falling particles is given in Appendix B 
by Eq. (B.12): 


d? xt dx” dx“ 
— +r —— = 0, 1.1.13 
du? tiw du du ( ) 
where T$% is the affine connection, given in Appendix B by Eq. (B.13), 


1 pr Ee IZ = 


pe = . 
w =E | gx Ba. aA 


(1.1.14) 


and u is a suitable variable parameterizing positions along the spacetime 
curve, proportional to t for massive particles. (A spacetime path x” = x (u) 
satisfying Eq. (1.1.13) is said to be a geodesic, meaning that the integral 
J dt is stationary under any infinitesimal variation of the path that leaves 
the endpoints fixed.) Note in particular that the derivatives djg99 and ġo; 
vanish, so I’ = 0. A particle at rest in these coordinates will therefore stay 
at rest, so these are co-moving coordinates, which follow the motion of typical 
observers. Because goo = —1, the proper time interval (— Suvdxdx”)'/ 2 for 
a co-moving clock is just dt, so t is the time measured in the rest frame of a 
co-moving clock. 

The meaning of the Robertson—Walker scale factor a(t) can be clarified 
by calculating the proper distance at time ¢ from the origin to a co-moving 
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object at radial coordinate r: 


sin™!r K=41 
=a(t)x 4 sinh™!r K = -—1 (1.1.15) 


r č dr 
d(r,t)=a(t ———— 
o=00 | Te a R 


In this coordinate system a co-moving object has r time-independent, so 
the proper distance from us to a co-moving object increases (or decreases) 
with a(t). Since there is nothing special about our own position, the proper 
distance between any two co-moving observers anywhere in the universe 
must also be proportional to a(t). The rate of change of any such proper 
distance d (t) is just 


d=da/a. (1.1.16) 


We will see in the following section that in fact a(t) is increasing. 
We also need the non-zero components of the affine connection, given 
by Eq. (1.1.14) as: 


0 l , x!x/ 
r} = -5 (soi + goji — gijo) = aa | 5 + K—— 


1 — Kx? 
= aagi , (1.1.17) 
ry = 78 g” (sij + 89.0 — 80) i) = 7 > (1.1.18) 
; lom (3Em Iim Əy Be 
I im jm m j i 


Here gj and Pi are the purely spatial metric and affine connection, and g!/ 


is the Seite of the 3 x 3 matrix gj, which in general is different from the 
ij component of the reciprocal of the 4 x 4 matrix guv. In quasi-Cartesian 
coordinates, 


xix 
1 — Kx?’ 
We can use these components of the affine connection to find the motion 
of a particle that is not at rest in the co-moving coordinate system. First, 


let’s calculate the rate of change of the momentum of a particle of non-zero 
mass mo. Consider the quantity 


dx! dx/ 
P = mey (1.1.21) 


where dt? = dt? — gjdx'dx/. In a locally inertial Cartesian coordinate 
system, for which gj = 47, we have dt = dtv/1 — v? where vi = dx'/dt, 


Sek Pi = K gx" . (1.1.20) 
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so Eq. (1.1.21) is the formula given by special relativity for the magnitude 
of the momentum. On the other hand, the quantity (1.1.21) is evidently 
invariant under arbitrary changes in the spatial coordinates, so we can eval- 
uate it just as well in co-moving Robertson—Walker coordinates. This can 
be done directly, using Eq. (1.1.13), but to save work, suppose we adopt a 
spatial coordinate system in which the particle position is near the origin 
x! = 0, where & = 5, + O(x’), and we can therefore ignore the purely 
spatial components BA of the affine connection. General relativity gives 
the equation of motion 


dx ee) dx" dx” _ _2da dx! dt 
dr? = "dr dr adtdtdt’ 

Multiplying with dt/dt gives 

d (dx! _ 2dadxi 

dt\dt) adtdt’ 


whose solution is 


dx! 1 
— a =. 1.1.22 
dr ~ a? (t) ( ) 
Using this in Eq. (1.1.21) with a metric gj; = a° (t)ô;j, we see that 
P(t) x l/a(t). (1.1.23) 


This holds for any non-zero mass, however small it may be compared to 
the momentum. Hence, although for photons both mo and dt vanish, 
Eq. (1.1.23) is still valid. 

It is important to characterize the paths of photons and material particles 
in interpreting astronomical observations (especially of gravitational lenses, 
in Chapter 9). Photons and particles passing through the origin of our 
spatial coordinate system obviously travel on straight lines in this coordinate 
system, which are spatial geodesics, curves that satisfy the condition 


dxi dd dx! 
=0, 1.1.24 
ds? Te ds ds ( ) 
where ds is the three-dimensional proper length 
ds? = Žij dx! dx! . (1.1.25) 


But the property of being a geodesic is invariant under coordinate transfor- 
mations (since it states the vanishing of a vector), so the path of the photon 
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or particle will also be a spatial geodesic in any spatial coordinate system, 
including those in which the photon or particle’s path does not pass through 
the origin. (This can be seen in detail as follows. Using Eqs. (1.1.17) and 
(1.1.18), the equations of motion (1.1.13) ofa photon or material particle are 


d2x! . dxi dx! 2a dx’ dt 
0= E N 1.1.26 
du? J du du a du du ( ) 

2 i qi 
pO eg o S (1.1.27) 


Eq. (1.1.26) can be written 


o=(*) dxi pidi dx! | |ds 2a dt ds | dx’ (1.1.28) 
~~ \ du ds? I ds ds du adudu| d? `` ` 


where s is so far arbitrary. If we take s to be the proper length (1.1.25) in 
the spatial geometry, then as we have seen 


du? x dt? x dt — a? ds? 


Dividing by du?, differentiating with respect to u, and using Eq. (1.1.27) 
shows that 


d?s 2àdt ds 


du? i a dudu  ’ 
so that Eq. (1.1.28) gives Eq. (1.1.24).) 

There are various smoothed-out vector and tensor fields, like the current 
of galaxies and the energy-momentum tensor, whose mean values satisfy the 
requirements of isotropy and homogeneity. Isotropy requires that the mean 
value of any three-vector vê must vanish, and homogeneity requires the 
mean value of any three-scalar (that is, a quantity invariant under purely 
spatial coordinate transformations) to be a function only of time, so the 
current of galaxies, baryons, etc. has components 


Ji=0, V =n), (1.1.29) 


with n(t) the number of galaxies, baryons, etc. per proper volume in a co- 
moving frame of reference. If this is conserved, in the sense of Eq. (B.38), 
then 


oJ" dn dn dan 
0=J".,, = — 4+T# J" = — 4+Tin = — 4+3—- 
= Bye t w ma emer ae A 
so 
constant 
j= nw, 1.1.30 
nD = O (1.1.30) 
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This shows the decrease of number densities due to the expansion of the 
co-moving coordinate mesh for increasing a(t). 

Likewise, isotropy requires the mean value of any three-tensor tË atx = 0 
to be proportional to 6; and hence to g}, which equals a~*5; at x = 0. 
Homogeneity requires the proportionality coefficient to be some function 
only of time. Since this is a proportionality between two three-tensors tË and 
gÏ it must remain unaffected by an arbitrary transformation of space coor- 
dinates, including those transformations that preserve the form of g! while 
taking the origin into any other point. Hence homogeneity and isotropy 
require the components of the energy-momentum tensor everywhere to take 
the form 


T= pith, T” =0, TI=8*(R)a?rOpM. (1.1.31) 


(These are the conventional definitions of proper energy density p and pres- 
sure p, as given by Eq. (B.43) in the case of a velocity four-vector with u’ = 0, 
u? = 1.) The momentum conservation law T iu., = 0 is automatically sat- 
isfied for the Robertson—Walker metric and the energy-momentum tensor 
(1.1.31), but the energy conservation law gives the useful information 


o= 7%, = $707 4 Pe T” 
= oe erin or “ Hoto), 
so that 
P+ (n+p) =0. (1.1.32) 


This can easily be solved for an equation of state of the form 
p=woe (1.1.33) 
with w time-independent. In this case, Eq. (1.1.32) gives 
pear (1.1.34) 
In particular, this applies in three frequently encountered extreme cases: 
e Cold Matter (e.g. dust): p = 0 
p xa”? (1.1.35) 


¢ Hot Matter (e.g. radiation): p = p/3 
paxa (1.1.36) 
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¢ Vacuum energy: As we will see in Section 1.5, there is another kind of 
energy-momentum tensor, for which T“” œ g"”, so that p = —p, in 
which case the solution of Eq. (1.1.32) is that p isa constant, known (up 
to conventional numerical factors) either as the cosmological constant 
or the vacuum energy. 


These results apply separately for coexisting cold matter, hot matter, and 
a cosmological constant, provided that there is no interchange of energy 
between the different components. They will be used together with the 
Einstein field equations to work out the dynamics of the cosmic expansion 
in Section 1.5. 

So far, we have considered only local properties of the spacetime. Now 
let us look at it in the large. For K = +1 space is finite, though like any 
spherical surface it has no boundary. The coordinate system used to derive 
Eq. (1.1.7) with K = +1 only covers half the space, with z > 0, in the same 
way that a polar projection map of the earth can show only one hemisphere. 
Taking account of the fact that z can have either sign, the circumference of 
the space is 27ra, and its volume is 2n7a3. 

The spaces with K = 0 or K = —1 are usually taken to be infinite, but 
there are other possibilities. It is also possible to have finite spaces with 
the same local geometry, constructed by imposing suitable conditions of 
periodicity. For instance, in the case K = 0 we might identify the points 
x and x + m Ly + mL + 1313, where nı, n2, n3 run over all integers, and 
Lı, L2, and L; are fixed non-coplanar three-vectors that characterize the 
space. This space is then finite, with volume aL, - (L2 x L3). Looking out 
far enough, we should see the same patterns of the distribution of matter 
and radiation in opposite directions. There is no sign of this in the observed 
distribution of galaxies or cosmic microwave background fluctuations, so 
any periodicity lengths such as |L;| must be larger than about 10!° light 
years.8 

Therearean infinite number of possible periodicity conditions for K = —1 
as well as for K = +1 and K = 0.? We will not consider these possibilities 
further here, because they seem ill-motivated. In imposing conditions of 
periodicity we give up the rotational (though not translational) symmetry 
that led to the Robertson—Walker metric in the first place, so there seems 
little reason to impose these periodicity conditions while limiting the local 
spacetime geometry to that described by the Robertson—Walker metric. 


8N. J. Cornish et al., Phys. Rev. Lett. 92, 201302 (2004); N. G. Phillips & A. Kogut, Astrophys. J. 
545, 820 (2006) [astro-ph/0404400]. 

°For reviews of this subject, see G. F. R. Ellis, Gen. Rel. & Grav. 2, 7 (1971); M. Lachiéze-Rey 
and J.-P. Luminet, Phys. Rept. 254, 135 (1995); M. J. Rebouças, in Proceedings of the Xth Brazilian 
School of Cosmology and Gravitation, eds. M. Novello and S. E. Perez Bergliaffa (American Institute 
of Physics Conference Proceedings, Vol. 782, New York, 2005): 188 [astro-ph/0504365]. 


1 The Expansion of the Universe 
1.2 The cosmological redshift 


The general arguments of the previous section gave no indication whether 
the scale factor a(t) in the Robertson—Walker metric (1.1.9) is increasing, 
decreasing, or constant. This information comes to us from the observa- 
tion of a shift in the frequencies of spectral lines from distant galaxies as 
compared with their values observed in terrestrial laboratories. 

To calculate these frequency shifts, let us adopt a Robertson—Walker 
coordinate system in which we are at the center of coordinates, and consider 
a light ray coming to us along the radial direction. A ray of light obeys the 
equation dt” = 0, so for such a light ray Eq. (1.1.11) gives 


dt = ta(t) (1.2.1) 


dr 
V1 — Kr? 
For a light ray coming toward the origin from a distant source, r decreases 
as ¢ increases, so we must choose the minus sign in Eq. (1.2.1). Hence if 
light leaves a source at co-moving coordinate rı at time 1), it arrives at the 
origin r = 0 at a later time fg, given by 


t dt ri d 
J —-= | sz (1.2.2) 
ty a(t) 0 yl1— Kr2 
Taking the differential of this relation, and recalling that the radial coord- 
inate rı of co-moving sources is time-independent, we see that the interval 
ôtı between departure of subsequent light signals is related to the interval 
ôto between arrivals of these light signals by 
ot} 7 dto 
a(t) a(to) 


If the “signals” are subsequent wave crests, the emitted frequency is vy = 
1/dt,, and the observed frequency is vo = 1/ôto, so 


vo/vı = a(tı)/a(to) . (1.2.4) 


(1.2.3) 


Ifa(t) is increasing, then this is a redshift, a decrease in frequency by a factor 
a(tı)/a(tọ), equivalent to an increase in wavelength by a factor convention- 
ally called 1 + z: 


1 +z = a(tọ)/a(tı) . (1.2.5) 


Alternatively, if a(t) is decreasing then we have a blueshift, a decrease in 
wavelength given by the factor Eq. (1.2.5) with z negative. These results are 
frequently interpreted in terms of the familiar Doppler effect; Eq. (1.1.15) 
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shows that for an increasing or decreasing a(t), the proper distance to any co- 
moving source of light like a typical galaxy increases or decreases with time, 
so that such sources are receding from us or approaching us, which naturally 
produces a redshift or blueshift. For this reason, galaxies with redshift (or 
blueshift) z are often said to have a cosmological radial velocity cz. (The 
meaning of relative velocity is clear only for z « 1, so the existence of 
distant sources with z > 1 does not imply any violation of special relativity.) 
However, the interpretation of the cosmological redshift as a Doppler shift 
can only take us so far. In particular, the increase of wavelength from 
emission to absorption of light does not depend on the rate of change of 
a(t) at the times of emission or absorption, but on the increase of a(t) in the 
whole period from emission to absorption. 

We can also understand the frequency shift (1.2.4) by reference to the 
quantum theory of light: The momentum of a photon of frequency v is 
hv/c (where / is Planck’s constant), and we saw in the previous section that 
this momentum varies as | /a(f). 

For nearby sources, we may expand a(t) in a power series, so 


a(t) ~ alto) [1+ (t — to) Ho +...] (1.2.6) 
where Ho is a coefficient known as the Hubble constant: 
Ho = a(to)/a(to) . (1.2.7) 
Eq. (1.2.5) then gives the fractional increase in wavelength as 
z = Ho (to -—ti)+.... (1.2.8) 


Note that for close objects, tọ — tı is the proper distance d (in units with 
c = 1). We therefore expect a redshift (for Ho > 0) or blueshift (for Ho < 0) 
that increases linearly with the proper distance d for galaxies close enough 
to use the approximation (1.2.6): 


z= Hod +... (1.2.9) 


The redshift of light from other galaxies was first observed in the 1910s 
by Vesto Melvin Slipher at the Lowell Observatory in Flagstaff, Arizona. 
In 1922, he listed 41 spiral nebulae, of which 36 had positive z up to 0.006, 
and only 5 had negative z, the most negative being the Andromeda nebula 
M31, with z = —0.001.! From 1918 to 1925 C. Wirtz and K. Lundmark? 


Iv M. Slipher, table prepared for A. S. Eddington, The Mathematical Theory of 
Relativity, 2nd ed. (Cambridge University Press, London, 1924): 162. 

2C, Wirtz, Astr. Nachr. 206, 109 (1918); ibid. 215, 349 (1921); ibid. 216, 451 (1922); ibid. 222, 21 
(1924); Scientia 38, 303 (1925); K. Lundmark, Stock. Hand. 50, No. 8 (1920); Mon. Not. Roy. Astron. 
Soc. 84, 747 (1924); ibid. 85, 865 (1925). 
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discovered a number of spiral nebulae with redshifts that seemed to increase 
with distance. But until 1923 it was only possible to infer the relative dis- 
tances of the spiral nebulae, using observations of their apparent luminosity 
or angular diameter. With the absolute luminosity and physical dimensions 
unknown, it was even possible that the spiral nebulae were outlying parts 
of our own galaxy, as was in fact believed by many astronomers. Edwin 
Hubble’s 1923 discovery of Cepheid variable stars in the Andromeda neb- 
ula M31 (discussed in the next section) allowed him to estimate its distance 
and size, and made it clear that the spiral nebulae are galaxies like our own, 
rather than objects in our own galaxy. 

No clear linear relation between redshift and distance could be seen in 
the early data of Slipher, Wirtz, and Lundmark, because of a problem that 
has continued to bedevil measurements of the Hubble constant down to 
the present. Real galaxies generally do not move only with the general 
expansion or contraction of the universe; they typically have additional 
“peculiar” velocities of hundreds of kilometers per second, caused by grav- 
itational fields of neighboring galaxies and intergalactic matter. To see a 
linear relation between redshift and distance, it is necessary to study galax- 
ies with |z| >> 1073, whose cosmological velocities ze are thousands of 
kilometers per second. 

In 1929 Hubble? announced that he had found a “roughly linear” relation 
between redshift and distance. But at that time redshifts and distances 
had been measured only for galaxies out to the large cluster of galaxies in 
the constellation Virgo, whose redshift indicates a radial velocity of about 
1,000 km/sec, not much larger than typical peculiar velocities. His data 
points were therefore spread out widely in a plot of redshift versus distance, 
and did not really support a linear relation. But by the early 1930s he 
had measured redshifts and distances out to the Coma cluster, with redshift 
z ~ 0.02, corresponding to a recessional velocity of about 7,000 km/sec, and 
a linear relation between redshift and distance was evident. The conclusion 
was clear (at least, to some cosmologists): the universe really is expand- 
ing. The correctness of this interpretation of the redshift is supported by 
observations to be discussed in Section 1.7. 

From Hubble’s time to the present galaxies have been discovered with 
ever larger redshifts. Galaxies were found with redshifts of order unity, 
for which expansions such as Eq. (1.2.9) are useless, and we need formulas 
that take relativistic effects into account, as discussed in Sections 1.4 and 
1.5. At the time of writing, the largest accurately measured redshift is for 
a galaxy observed with the Subaru telescope.*t The Lyman alpha line from 


3E. P. Hubble, Proc. Nat. Acad. Sci. 15, 168 (1929). 
4M. Iye et al., Nature 443, 186 (2006) [astro-ph/0609393]. 
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this galaxy (emitted in the transition from the 2p to 1s levels of hydrogen), 
which is normally at an ultraviolet wavelength of 1,215 A, is observed at the 
infrared wavelength of 9,682 A, indicating a redshift 1 + z = 9682/1215, or 
z = 6.96. 

It may eventually become possible to measure the expansion rate H (t) = 
a(t)/a(t) at times t earlier than the present, by observing the change in very 
accurately measured redshifts of individual galaxies over times as short as 
a decade.” By differentiating Eq. (1.2.5) we see that the rate of change of 
redshift with the time of observation is 


dz _ (to) alto) a(t1) dti 
dto a(t) a? (tı) dto 


d 
: [Ho — Hos | (+z). 


From the same argument that led to Eq. (1.2.3) we have dt /dto = 1/(1 +2), 
so if we measure dz/dto we can find the expansion rate at the time of light 
emission from the formula 


H(t) = Wo +2) — Z. (1.2.10) 


1.3 Distances at small redshift: The Hubble constant 


We must now think about how astronomical distances are measured. In 
this section we will be considering objects that are relatively close, say 
with z not much greater than 0.1, so that effects of the spacetime curva- 
ture and cosmic expansion on distance determinations can be neglected. 
These measurements are of cosmological importance in themselves, as they 
are used to learn the value of the Hubble constant Ho. Also, distance 
measurements at larger redshift, which are used to find the shape of the 
function a(t), rely on the observations of “standard candles,” objects of 
known intrinsic luminosity, that must be identified and calibrated by stud- 
ies at these relatively small redshifts. Distance determinations at larger 
redshift will be discussed in Section 1.6, after we have had a chance to lay 
a foundation in Sections 1.4 and 1.5 for an analysis of the effects of expan- 
sion and spacetime geometry on measurements of distances of very distant 
objects. 

It is conventional these days to separate the objects used to measure dis- 
tances in cosmology into primary and secondary distance indicators. The 
absolute luminosities of the primary distance indicators in our local group 


5A. Loeb, Astrophys. J. 499, L111 (1998) [astro-ph/9802122]; P-S. Corasaniti, D. Huterer, and 
A. Melchiorri, Phys. Rev. D 75, 062001 (2007) [astro-ph/0701433]. For an earlier suggestion along this 
line, see A. Sandage, Astrophys. J. 139, 319 (1962). 
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of galaxies are measured either directly, by kinematic methods that do not 
depend on an a priori knowledge of absolute luminosities, or indirectly, by 
observation of primary distance indicators in association with other primary 
distance indicators whose distance is measured by kinematic methods. The 
sample of these relatively close primary distance indicators is large enough 
to make it possible to work out empirical rules that give their absolute 
luminosities as functions of various observable properties. Unfortunately, 
the primary distance indicators are not bright enough for them to be stud- 
ied at distances at which z is greater than about 0.01, redshifts at which 
cosmological velocities cz would be greater than typical random depar- 
tures of galactic velocities from the cosmological expansion, a few hundred 
kilometers per second. Thus they cannot be used directly to learn about 
a(t). For this purpose it is necessary to use secondary distance indicators, 
which are bright enough to be studied at these large distances, and whose 
absolute luminosities are known through the association of the closer ones 
with primary distance indicators. 


A. Primary distance indicators! 


Almost all distance measurements in astronomy are ultimately based on 
measurements of the distance of objects within our own galaxy, using one 
or the other of two classic kinematic methods. 


1. Trigonometric parallax 

The motion of the earth around the sun produces an annual motion of the 
apparent position of any star around an ellipse, whose maximum angular 
radius z is given in radians (for z < 1, which is the case for all stars) by 


d 


where d is the star’s distance from the solar system, and dg is the mean dis- 
tance of the earth from the sun,? defined as the astronomical unit, 


x (1.3.1) 


lFor a survey, see M. Feast, in Nearby Large-Scale Structures and the Zone of Avoidance, eds. 

A. P. Fairall and P. Woudt (ASP Conference Series, San Francisco, 2005) [astro-ph/0405440]. 
2The history of measurements of distances in the solar system goes back to Aristarchus of Samos 
(circa 310 BC-230 BC). From the ratio of the breadth of the earth’s shadow during a lunar eclipse to the 
angular diameter of the moon he estimated the ratio of the diameters of the moon and earth; from the 
angular diameter of the moon he estimated the ratio of the diameter of the moon to its distance from 
the earth; and from the angle between the lines of sight to the sun and moon when the moon is half full 
he estimated the ratio of the distances to the sun and moon; and in this way he was able to measure the 
distance to the sun in units of the diameter of the earth. Although the method of Aristarchus was correct, 
his observations were poor, and his result for the distance to the sun was far too low. [For an account 
of Greek astronomy before Aristarchus and a translation of his work, see T. L. Heath, Aristarchus of 
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1 AU = 1.496 x 108 km. A parsec (pc) is defined as the distance at which 
x = 1"; there are 206,264.8 seconds of arc per radian so 


1 pe = 206,264.8 AU = 3.0856 x 10! km = 3.2616 light years. 


The parallax in seconds of arc is the reciprocal of the distance in parsecs. 

The first stars to have their distances found by measurement of their 
trigonometric parallax were a Centauri, by Thomas Henderson in 1832, 
and 61 Cygni, by Friedrich Wilhelm Bessel in 1838. These stars are at 
distances 1.35 pc and 3.48 pc, respectively. The earth’s atmosphere makes 
it very difficult to measure trigonometric parallaxes less than about 0.03” 
from ground-based telescopes, so that for many years this method could 
be used to find the distances of stars only out to about 30 pc, and at these 
distances only for a few stars and with poor accuracy. 

This situation has been improved by the launching of a European Space 
Agency satellite known as Hipparcos, used to measure the apparent pos- 
itions and luminosities of large numbers of stars in our galaxy.? For stars 
of sufficient brightness, parallaxes could be measured with an accuracy 
(standard deviation) in the range of 7 to 9 x10~4 arc seconds. Of the 
118,000 stars in the Hipparcos Catalog, it was possible in this way to find 
distances with a claimed uncertainty of no more than 10% for about 20,000 
stars, some at distances over 100 pc. 


2. Proper motions 

A light source at a distance d with velocity v} perpendicular to the line 
of sight will appear to move across the sky at a rate u in radians/time 
given by 


u=v/d. (1.3.2) 


This is known as its proper motion. Of course, astronomers generally have no 
way of directly measuring the transverse velocity v1, but they can measure 
the component v, of velocity along the line of sight from the Doppler shift 
of the source’s spectral lines. The problem is to infer v, from the measured 
value of v,. This can be done in a variety of special cases: 


e Moving clusters are clusters of stars that were formed together and 
hence move on parallel tracks with equal speed. (These are open 


Samos (Oxford University Press, Oxford, 1913).] The first reasonably accurate determination of the 
distance of the earth to the sun was made by the measurement of a parallax. In 1672 Jean Richer and 
Giovanni Domenico Cassini measured the distance from the earth to Mars, from which it was possible 
to infer the distance from the earth to the sun, by observing the difference in the apparent direction to 
Mars as seen from Paris and Cayenne, which are separated by a known distance of 6,000 miles. Today 
distances within the solar system are measured very accurately by measurement of the timing of radar 
echoes from planets and of radio signals from transponders carried by spacecraft. 
3M. A.C. Perryman et al., Astron. Astrophys. 323, L49 (1997). 
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clusters, in the sense that they are not held together by gravitational 
attraction, in distinction to the much larger globular clusters whose 
spherical shape indicates a gravitationally bound system.) The most 
important such cluster is the Hyades (called by Tennyson’s Ulysses the 
“rainy Hyades”), which contains over 100 stars. The velocities of these 
stars along the line of sight are measured from their Doppler shifts, 
and if we knew the distance to the cluster then the velocities of its 
stars at right angles to the line of sight could be measured from their 
proper motions. The distance to the cluster was determined long ago 
to be about 40 pc by imposing the constraint that all these velocities 
are parallel. Distances measured in this way are often expressed as 
moving cluster parallaxes. Since the advent of the Hipparcos satellite, 
the moving cluster method has been supplemented with a direct mea- 
surement of the trigonometric parallax of some of these clusters, 
including the Hyades. 


e 


A second method is based on the statistical analysis of the Doppler 
shifts and proper motions of stars in a sample whose relative 
distances are all known, for instance because they all have the same 
(unknown) absolute luminosity, or because they all at the same 
(unknown) distance. The Doppler shifts give the velocities along 
the line of sight, and the proper motions and the relative distances 
give the velocities transverse to the line of sight, up to a single overall 
factor related to the unknown absolute luminosity or distance. This 
factor can be determined by requiring that the distribution of veloc- 
ities transverse to the line of sight is the same as the distribution of 
velocities along the line of sight. Distances measured in this way are 
often called statistical parallaxes, or dynamical distances. 


e 


The distance to the Cepheid variable star ¢ Geminorum has been 
measured* by comparing the rate of change of its physical diame- 
ter, as found from the Doppler effect, with the rate of change of its 
angular diameter, measured using an optical interferometer. (About 
Cepheids, more below.) The distance was found to be 336 + 44 pc, 
much greater than could have been found from a trigonometric par- 
allax. This method has subsequently been extended to eight other 
Cepheids.» 


e It is becoming possible to measure distances by measuring the 
proper motion of the material produced by supernovae, assuming a 


4B. F. Lane, M. J. Kuchner, A. F. Boden, M. Creech-Eakman, and S. B. Kulkarni, Nature 407, 485 
(September 28, 2000). 
5P, Kervella et al., Astron. Astrophys. 423, 327 (2004) [astro-ph/0404179]. 
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more-or-less cylindrically symmetric explosion, so that the transverse 
velocity vı can be inferred from the radial velocity v, measured by 
Doppler shifts. This method has been applied® to the ring around 
the supernova SN1987A, observed in 1987 in the Large Magellanic 
Cloud, with the result that its distance is 52 kpc (thousand parsecs). 


e 


The measurement of the time-varying Doppler shift and proper motion 
of an object in orbit around a central mass can be used to find the dis- 
tance to the object. For instance, if the line of sight happens to be in 
the plane of the orbit, and if the orbit is circular, then the Doppler 
shift is a maximum when the object is moving along the line of sight, 
and hence gives the orbital velocity v, while the proper motion u is a 
maximum when the object is moving with the same velocity at right 
angles to the line of sight, and gives the distance as v/a. This method 
can also be used for orbits that are inclined to the line of sight and not 
circular, by studying the time-variation of the Doppler shift and proper 
motion. The application of this method to the star S2, which orbits 
the massive black hole in the galactic center, gives what is now the best 
value for the distance of the solar system from the galactic center,’ 
as 8.0 + 0.4 kpc. This method also allows the measurement of some 
distances outside our galaxy, by using the motion of masers — point 
microwave sources — in the accretion disks of gas and dust in orbit 
around black holes at the centers of galaxies. The orbital velocity can 
be judged from the Doppler shifts of masers at the edge of the accretion 
disk, which are moving directly toward us or away from us, and if this 
is the same as the orbital velocities of masers moving transversely to 
the line of sight, then the ratio of this orbital velocity to their observed 
proper motion gives the distance to the galaxy. So far, this method 
has been used to measure the distance to the galaxy NGC 4258,8 as 
7.2 + 0.5 Mpc (million parsecs), and to the galaxy M33,” as 0.730 + 
0.168 Mpc. 


These kinematic methods have limited utility outside the solar neighbor- 
hood. We need a different method to measure larger distances. 


3. Apparent luminosity 
The most common method of determining distances in cosmology is based 
on the measurement of the apparent luminosity of objects of known (or 


6N. Panagia, Mem. Soc. Astron. Italiana 69, 225 (1998). 

TR. Eisenhauer et al., Astrophys. J. Lett. 597, L121 (2003) [astro-ph/0306220]. 

8J, Herrnstein et al., Nature 400, 539 (3 August 1999). 

9A, Brunthaler, M. J. Reid, H. Falcke, L. J. Greenhill, and C. Henkel, Science 307, 1440 (2005) 
[astro-ph/0503058]. 
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supposedly known) absolute luminosity. The absolute luminosity L is the 
energy emitted per second, and the apparent luminosity £ is the energy 
received per second per square centimeter of receiving area. If the energy 
is emitted isotropically, then we can find the relation between the absolute 
and apparent luminosity in Euclidean geometry by imagining the luminous 
object to be surrounded with a sphere whose radius is equal to the distance 
d between the object and the earth. The total energy per second passing 
through the sphere is 47 d?°£, so 


pe L 
And?” 


This relation is subject to corrections due to interstellar and/or intergalac- 
tic absorption, as well as possible anisotropy of the source, which though 
important in practice involve too many technicalities to go into here. 

Astronomers unfortunately use a traditional notation for apparent and 
absolute luminosity in terms of apparent and absolute magnitude.'° In 
the second century A.D., the Alexandrian astronomer Claudius Ptolemy 
published a list of 1,022 stars, labeled by categories of apparent brightness, 
with bright stars classed as being of first magnitude, and stars just barely 
visible being of sixth magnitude.!! This traditional brightness scale was 
made quantitative in 1856 by Norman Pogson, who decreed that a difference 
of five magnitudes should correspond to a ratio of a factor 100 in apparent 
luminosities, so that £ œ 1077”/°. With the advent of photocells at the 
beginning of the twentieth century, it became possible to fix the constant 
of proportionality: the apparent bolometric luminosity (that is, including 
all wavelengths) is given in terms of the apparent bolometric magnitude 
m by 


(1.3.3) 


€=1077"/> x 2.52 x 107% erg cm~? s7! . (1.3.4) 


For orientation, Sirius has a visual magnitude mMyis = — 1.44, the Andromeda 
nebula M31 has m,j, = 0.1, and the large galaxy M87 in the nearest large 
cluster of galaxies has Myis = 8.9. The absolute magnitude in any wave- 
length band is defined as the apparent magnitude an object would have at 
a distance of 10 pc, so that the absolute bolometric luminosity is given in 
terms of the absolute bolometric magnitude M by 


L = 107?” x 3.02 x 10° erg s7! . (1.3.5) 


10For the history of the apparent magnitude scale, see J. B. Hearnshaw, The Meas- 
urement of Starlight: Two centuries of astronomical photometry (Cambridge University Press, 
Cambridge, 1996); K. Krisciunas, astro-ph/0106313. 

1 For the star catalog of Ptolemy, see M. R. Cohen and I. E. Drabkin, A Source Book in Greek Science 
(Harvard University Press, Cambridge, MA, 1948): p. 131. 
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For comparison, in the visual wavelength band the absolute magnitude Myis 
is +4.82 for the sun, +1.45 for Sirius, and —20.3 for our galaxy. Eq. (1.3.3) 
may be written as a formula for the distance in terms of the distance-modulus 
m—M: 


d = 101M Pp, | (1.3.6) 


There are several different kinds of star that have been used in measure- 
ments of distance through the observation of apparent luminosity: 


e Main Sequence: Stars that are still burning hydrogen at their cores 
obey a characteristic relation between absolute luminosity and color, 
both depending on mass. This is known as the main sequence, discov- 
ered in the decade before the First World War by Ejnar Hertzsprung 
and Henry Norris Russell. The luminosity is greatest for blue-white 
stars, and then steadily decreases for colors tending toward yellow and 
red. The shape of the main sequence is found by observing the appar- 
ent luminosities and colors of large numbers of stars in clusters, all of 
which in each cluster may be assumed to be at the same distance from 
us, but we need to know the distances to the clusters to calibrate abso- 
lute luminosities on the main sequence. For many years the calibration 
of the main sequence absolute luminosities was based on observation 
of a hundred or so main sequence stars in the Hyades cluster, whose 
distance was measured by the moving cluster method described above. 
With the advent of the Hipparcos satellite, the calibration of the main 
sequence has been greatly improved through the observation of col- 
ors and apparent luminosities of nearly 100,000 main sequence stars 
whose distance is known through measurement of their trigonomet- 
ric parallax. Including in this sample are stars in open clusters such 
as the Hyades, Praesepe, the Pleiades, and NGC 2516; these clusters 
yield consistent main sequence absolute magnitudes if care is taken 
to take proper account of the varying chemical compositions of the 
stars in different clusters.!™? With the main sequence calibrated in 
this way, we can use Eq. (1.3.3) to measure the distance of any star 
cluster or galaxy in which it is possible to observe stars exhibiting 
the main sequence relation between apparent luminosity and color. 
Distances measured in this way are sometimes known as photometric 
parallaxes. 

The analysis of the Hipparcos parallax measurements revealed a dis- 


crepancy between the distances to the Pleiades star cluster measured by 
observations of main sequence stars and by measurements of 


125, M. Percival, M. Salaris, and D. Kilkenny, Astron. Astrophys. 400, 541 (2003) [astro-ph/0301219]. 
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trigonometric parallax.!3 The traditional method, using a main 


sequence calibration based on the application of the moving cluster 
method to the closer Hyades cluster, gave a distance to the Pleiades!4 
of 132 + 4 pc. Then trigonometric parallaxes of a number of stars in 
the Pleiades measured by the Hipparcos satellite gave a distance!> of 
118 + 4 pc, in contradiction with the results of main sequence fitting. 
More recently, these Hipparcos parallaxes have been contradicted by 
more accurate measurements of the parallaxes of three stars in the 
Pleiades with the Fine Guidance Sensor of the Hubble Space Tele- 
scope,!° which gave a distance of 133.5 + 1.2 pc, in good agreement 
with the main sequence results. At the time of writing, the balance 
of astronomical opinion seems to be favoring the distances given by 
main sequence photometry.!” 


Red Clump Stars: The color-magnitude diagram of clusters in metal- 
rich!® parts of the galaxy reveals distinct clumps of red giant stars 
in a small region of the diagram, with a spread of only about 0.2 in 
visual magnitude. These are stars that have exhausted the hydrogen 
at their cores, with helium taking the place of hydrogen as the fuel for 
nuclear reactions at the stars’ cores. The absolute magnitude of the 
red clump stars in the infrared band (wavelengths around 800 nm) has 
been determined!? to be My = —0.28 + 0.2 mag, using the distances 
and apparent magnitudes measured with the Hipparcos satellite and in 
an earlier survey.” In this band there is little dependence of absolute 
magnitude on color, but it has been argued that even the infrared 
magnitude may depend significantly on metallicity.! 


RR Lyrae Stars: These are variable stars that have been used as 
distance indicators for many decades.” They can be recognized by 
their periods, typically 0.2 to 0.8 days. The use of the statistical par- 
allax, trigonometric parallax and moving cluster methods (with data 


Le 


As 


3B. Paczynski, Nature 227, 299 (22 January, 2004). 

4G. Meynet, J.-C. Mermilliod, and A. Maeder, Astron. Astrophys. Suppl. Ser. 98, 477 
(1993). 

5J.-C. Mermilliod, C. Turon, N. Robichon, F. Arenouo, and Y. Lebreton, in ESA SP-402 Hipparcos- 
Venice ‘97, eds. M.A.C. Perryman and P. L. Bernacca (European Space Agency, Paris, 1997), 643; F. van 
euwen and C. S. Hansen Ruiz, ibid, 689; F. van Leeuwen, Astron. Astrophys. 341, L71 (1999). 

6D. R. Soderblom et al., Astron. J. 129, 1616 (2005) [astro-ph/0412093]. 

7A new reduction of the raw Hipparcos data is given by F. van Leeuwen and E. Fantino, Astron. 
trophys. 439, 791 (2005) [astro-ph/0505432]. 

8 Astronomers use the word “metal” to refer to all elements heavier than helium. 

OB. Paczyński and K. Z. Stanek, Astrophys. J. 494, L219 (1998). 


20A, Udalski et al., Acta. Astron. 42, 253 (1992). 
2L, Girardi, M. A. T. Groenewegen, A. Weiss, and M. Salaris, astro-ph/9805127. 
22For a review, see G. Bono, Lect. Notes Phys. 635, 85 (2003) [astro-ph/0305102]. 
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from both ground-based observatories and the Hipparcos satellite) 
give respectively?’ an absolute visual magnitude for RR Lyrae stars 
in our galaxy’s halo of 0.77 + 0.13, 0.71 + 0.15, and 0.67 + 0.10, in 
good agreement with an earlier result?4 Myis = 0.71 + 0.12 for halo 
RR Lyrae stars and 0.79 + 0.30 for RR Lyrae stars in the thick disk 
of the galaxy. RR Lyrae stars are mostly too far for a measurement 
of their trigonometric parallax, but recently measurements?’ with the 
Hubble Space Telescope have given a value of 3.82 x 107? arcsec for the 
trigonometric parallax of the eponymous star RR Lyr itself, implying 
an absolute visual magnitude of 0.617%! 


Eclipsing Binaries: In favorable cases it is possible to estimate the 
intrinsic luminosity of a star that is periodically partially eclipsed by 
a smaller companion, without the use of any intermediate distance 
indicators. The velocity of the companion can be inferred from the 
Doppler shift of its spectral lines (with the ellipticity of the orbit 
inferred from the variation of the Doppler shift with time), and the 
radius of the primary star can then be calculated from the duration 
of the eclipse. The temperature of the primary can be found from 
measurement of its spectrum, typically from its apparent luminosity 
in various wavelength bands. Knowing the radius, and hence the area, 
and the temperature of the primary, its absolute luminosity can then be 
calculated from the Stefan—Boltzmann law for black body radiation. 
This method has been applied to measure distances to two neighboring 
dwarf galaxies, the Large Magellanic Cloud (LMC) and the Small 
Magellanic Cloud (SMC),7’ and to the Andromeda galaxy M3178 and 
its satellite M33.?? 


Cepheid variables: Because they are so bright, these are by far the 
most important stars used to measure distances outside our galaxy. 
Named after the first such star observed, 6 Cephei, they can be 


23p Popowski and A. Gould, Astrophys. J. 506, 259, 271 (1998); also astro-ph/9703140, astro- 


ph/9802168; and in Post-Hipparcos Cosmic Candles, eds. A. Heck and F. Caputo (Kluwer Academic 
Publisher, Dordrecht) [astro-ph/9808006]; A. Gould and P. Popowski, Astrophys. J. 568, 544 (1998) 
[astro-ph/9805176]; and references cited therein. 


24A. Layden, R. B. Hanson, S. L. Hawley, A. R. Klemola, and C. J. Hanley, Astron. J. 112, 2110 


(1996). 


25G, F, Benedict et al., Astrophys. J. 123, 473 (2001) [astro-ph/0110271] 
26E, F. Guinan et al., Astrophys. J. 509, L21 (1998); E. L. Fitzpatrick et al. Astrophys. J. 587, 685 


(2003). 


277, J. Harries, R. W. Hilditch, and I. D. Howarth, Mon. Not. Roy. Astron. Soc. 339, 157 


(2003); R. W. Hilditch, I. D. Howarth, and T. J. Harries, Mon. Not. Roy. Astron. Soc. 357, 304 
(2005). 


28T, Ribas et al., Astrophys. J. 635, L37 (2005). 
294. Z. Bonanos et al., Astrophys. Space Sci. 304, 207 (2006) [astro-ph/0606279]. 


21 


1 The Expansion of the Universe 


recognized from the characteristic time dependence of their luminos- 
ity, with periods ranging from 2 to 45 days. (Cepheids in other galaxies 
have been observed with periods extending up to 100 days.) In 1912 
Henrietta Swan Leavitt discovered that the Cepheid variables in the 
Small Magellanic Cloud (SMC) have apparent luminosities given by 
a smooth function of the period of the variation in luminosity, but 
the distance to the SMC was not known. Having measured the dist- 
ances and apparent luminosities of several Cepheids in open clusters, 
and hence their absolute luminosities, it became possible to calibrate 
the relation between period and luminosity. Cepheid variables thus 
became a “standard candle” that could be used to measure the dis- 
tance to any galaxy close enough for Cepheids to be seen. It was the 
discovery of Cepheids in M31, together with Leavitt’s calibration of 
the Cepheid period—luminosity relation, that allowed Edwin Hubble in 
1923 to measure the distance of M31, and show that it was far outside 
our own galaxy, and hence a galaxy in its own right. 


Today the form of the Cepheid period—luminosity relation is 
derived more from the Large Magellanic Cloud (LMC), where there 
are many Cepheids, and the dependence of the absolute luminosity on 
color is also taken into account. The calibration of Cepheid absolute 
luminosities can therefore be expressed as (and often in fact amounts 
to) a measurement of the distance to the LMC. Main sequence pho- 
tometry and other methods gave what for some years was a generally 
accepted LMC distance modulus of 18.5 mag, corresponding accord- 
ing to Eq. (1.3.6) to a distance of 5.0 x 104 pc. The use of red clump 
stars? has given a distance modulus of 18.47, with a random error 
+0.01, and a systematic error ane A large catalog?! of Cepheids in 
the LMC has been interpreted by the members of the Hubble Space 
Telescope Key Project?” to give the Cepheid visual and infrared abso- 
lute magnitudes as functions of the period P in days: 


My = —2.760 logig P — 1.458 , My, = —2.962 logio P — 1.942 , 
(1.3.7) 
under the assumption that the LMC distance modulus is 18.5. 
This result was challenged in two distinct ways, which illustrate the 
difficulty of this sort of distance measurement: 


First, there have been discordant measurements of the distance to 
the LMC. Under the assumption that red clump stars in the LMC 


30M. Salaris, S. Percival, and L. Girardi, Mon. Not. Roy. Astron. Soc. 345, 1030 (2003) [astro- 
ph/0307329]. 

31A, Udalski et al., Acta Astr. 49, 201 (1999): Table 1. 

32W. L. Freedman et al., Astrophys. J. 553, 47 (2001). 


22 


1.3 Distances at small redshift: The Hubble constant 


have the same infrared luminosity as those in the local galactic disk, 
a distance modulus was found?’ that was 0.45 magnitudes smaller, 
giving a distance to the LMC that is smaller by a factor 0.8. This 
has in turn been challenged on the grounds that the stars in the LMC 
have distinctly lower metallicity than in the local disk; two groups 
taking this into account** have given LMC distance moduli of 18.36 + 
0.17 mag and 18.28 + 0.18 mag, in fair agreement with the previously 
accepted value. This also agrees with the measurement of the distance 
to the LMC inferred?’ from observations of RR Lyrae stars, which 
gives a distance modulus of 18.33 + 0.06 mag. This distance modulus 
for the LMC is further confirmed by the measurement of the distance 
of the eclipsing binary HV2274; taking account of its distance from 
the center of the LMC gives*® a distance modulus for the LMC of 
18.30 + 0.07. 


Second, there have been new calibrations of the Cepheid period— 
luminosity relation, that do not rely on Cepheids in the LMC, which 
together with observations of Cepheids in the LMC can be used to 
give an independent estimate of the LMC distance.*” In recent years 
the satellite Hipparcos** has measured trigonometric parallaxes for 
223 Cepheid variables in our galaxy, of which almost 200 can be used 
to calibrate the period—luminosity relation, without relying on main 
sequence photometry, red clump stars, or RR Lyrae stars. The nearest 
Cepheids are more than 100 pc away from us (the distance to Polaris 
is about 130 pc), so the parallaxes are just a few milliarcseconds, and 
individual measurements are not very accurate, but with about 200 
Cepheids measured it has been possible to get pretty good accuracy. 
One early result?’ gave the relation between the absolute visual mag- 
nitude My and the period P (in days) as 


My = —2.81log,) P — 1.43 +0.10 . 


This was a decrease of about 0.2 magnitudes from previous results, 
i.e., an increase of the intrinsic luminosity of Cepheids by a factor 
1002x2/5 = 1.20 leading to a 10% increase in all cosmic distances based 


33K. Z. Stanek, D. Zaritsky, and J. Harris, Astrophys. J. 500, L141 (1998) [astro-ph/9803181]. 

34A. A. Cole, Astrophys. J. 500, L137 (1998) [astro-ph/9804110]; L. Girardi et al., op. cit.. 

35p Popowski and A. Gould, op. cit.. 

36E, F, Guinan et al., op. cit.. 

37For a review, see M. Feast, Odessa Astron. Publ. 14 [astro-ph/01 10360]. 

38M. A. C. Perryman, Astron. Astrophys. 323, L49 (1997). 

39M. W. Feast and R. M. Catchpole, Mon. Not. Roy. Astron. Soc. 286, L1 (1997); also see F. Pont, 
in Harmonizing Cosmic Distances in a Post-Hipparcos Era, eds. D. Egret and A. Heck (ASP Conference 
Series, San Francisco, 1998) [astro-ph/9812074]; H. Baumgardt, C. Dettbarn, B. Fuchs, J. Rockmann, 
and R. Wielen, in Harmonizing Cosmic Distance Scales in a Post-Hipparcos Era, ibid [astro-ph/9812437]. 
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directly or indirectly on the Cepheid period—luminosity relation. With 
this value of Cepheid absolute luminosity, the LMC distance mod- 
ulus would be 18.66, or slightly less with corrections for the metal- 
licity of the LMC (though with the absolute luminosity of Cepheids 
calibrated by Hipparcos observations, the only relevance of the LMC 
for the Cepheid period—luminosity relation is to determine its shape.) 
This result for the Cepheid absolute luminosities has in turn been 
contradicted.*° 


These uncertainties may now have been resolved by measurements 
of the trigonometric parallax of Cepheids in our galaxy with the Fine 
Guidance Sensor of the Hubble Space Telescope. First, the trig- 
onometric parallax of 5 Cephei*! gave a distance of 273 + 11 pc, 
corresponding to an LMC distance modulus of 18.50 + 0.13. More 
recently, trigonometric parallaxes have been measured for nine Galactic 
Cepheids, giving an LMC distance modulus of 18.50 + 0.03, or with 
metallicity corrections, 18.40 + 0.05.42 


There has also been an independent calibration of the Cepheid 
period—luminosity relation through observations? of Cepheids in the 
galaxy NGC 4258, whose distance 7.2 + 0.5 Mpc has been measured 
using the observations of proper motions of masers in this galaxy 
mentioned above. This distance is in satisfactory agreement with the 
distance 7.6 + 0.3 Mpc obtained from the Cepheids in NGC 4258 
under the assumption that these Cepheids have the period—luminosity 
relation (1.3.7) obtained under the assumption that the LMC distance 
modulus is 18.5, which tends to confirm this period—luminosity rela- 
tion. But there are differences in the metallicity of the Cepheids in 
NGC 4258 and in the LMC, which makes this conclusion somewhat 
controversial.*4 A 2006 calibration of the Cepheid period—luminosity 
relation based on the study of 281 Cepheids in NGC 4258* (whose dis- 
tance, as we have seen, is known from observations of maser Doppler 
shifts and proper motions) gave an LMC distance modulus 18.41 + 
0.10 (stat.) +0.13 (syst.). This study includes both a field that is metal 
rich, like our Galaxy, and a field that is metal poor, like the LMC, so 


40 See, e.g., B. F. Madore and W. L. Freedman, Astrophys. J. 492, 110 (1998) For a recent survey of 
the theory underlying the Cepheid period—luminosity relation, see A. Gautschy, in Recent Results on 
Ho—19th Texas Symposium on Relativistic Astrophysics [astro-ph/9901021). 

41G. F. Benedict et al., Astrophys. J. 124, 1695 (2002). 

42G. F Benedict et al., Astron. J. 133, 1810 (2007), [astro-ph/0612465]. 

43]. A. Newman et al., Astrophys. J. 553, 562 (2001) [astro-ph-0012377]. 

44For instance, see B. Paczynski, Nature 401, 331 (1999); F. Caputo, M. Marconi, and I. Musella, 
Astrophys. J. [astro-ph/0110526]. 

45L, M. Macri et al., Astrophys. J. 652, 1133 (2006) [astro-ph/060821 1]. 


24 


1.3 Distances at small redshift: The Hubble constant 


it provides a calibration of the metallicity dependence of the Cepheid 
period—luminosity relation. 


In a 2003 survey*® the LMC distance modulus measured using a 
variety of distance indicators other than Cepheid variables (including 
RR Lyrae stars, red clump stars, etc.) was found to be 18.48 + 0.04, 
in very good agreement with the earlier value 18.52 + 0.05 found by 
observation of Cepheids, with the corrections adopted by the Hubble 
Space Telescope group. 


B. Secondary distance indicators 


None of the above distance indicators are bright enough to be used to 
measure distances at redshifts large enough so that peculiar velocities can 
be neglected compared with the expansion velocity, say, z > 0.03. For this 
we need what are called secondary distance indicators that are brighter than 
Cepheids, such as whole galaxies, or supernovae, which can be as bright as 
whole galaxies. 

For many years Cepheids could be used as distance indicators only out 
to a few million parsecs (Mpc), which limited their use to the Local Group 
(which consists of our galaxy and the Andromeda nebula M31, and a dozen 
or so smaller galaxies like M33 and the LMC and SMC) and some other 
nearby groups (the M81, M101, and Sculptor groups). This was not enough 
to calibrate distances to an adequate population of galaxies or supernovae, 
and so it was necessary to use a variety of intermediate distance indicators: 
globular clusters, HII regions, brightest stars in galaxies, etc. Now the 
Hubble Space Telescope allows us to observe Cepheids in a great many 
galaxies at much greater distances, out to about 30 Mpc, and so the sec- 
ondary distance indicators can now be calibrated directly, without the use 
of intermediate distance indicators. Four chief secondary distance indica- 
tors have been developed: 


1. The Tully—Fisher relation 

Although whole galaxies can be seen out to very large distances, it has 
not been possible to identify any class of galaxies with the same absolute 
luminosity. However, in 1977 Tully and Fisher“ developed a method for 
estimating the absolute luminosity of suitable spiral galaxies. The 21 cm 
absorption line in these galaxies (arising in transitions of hydrogen atoms 
from lower to the higher of their two hyperfine states) is widened by the 


46M. Feast, Lect. Notes Phys. 635, 45 (2003) [astro-ph/0301 100]. 
“R.B. Tully and J. R. Fisher, Astron. Astrophys. 54, 661 (1977) 
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Doppler effect, caused by the rotation of the galaxy. The line width W 
gives an indication of the maximum speed of rotation of the galaxy, which 
is correlated with the mass of the galaxy, which in turn is correlated with the 
galaxy’s absolute luminosity.“ (It is also possible to apply the Tully—Fisher 
relation using the width of other lines, such as a radio frequency transition 
in the carbon monoxide molecule.*”) 

In one application of this approach” the shape of the function L;(W) 
that gives the infrared band absolute luminosity as a function of 21 cm line 
width (that is, the absolute luminosity up to a common constant factor) was 
found from a sample of 555 spiral galaxies in 24 clusters, many with redshifts 
less than 0.01. (The relative distances to these galaxies were found from the 
ratios of their redshifts, using Eq. (1.2.9), so that the peculiar velocities of 
these galaxies introduced considerable errors into the estimated ratios of 
absolute luminosities of individual pairs of galaxies, but with 555 galaxies 
in the sample it could be assumed that these errors would cancel in a least- 
squares fit of the measured relative values of absolute luminosity to a smooth 
curve.) Roughly speaking, L;(W) turned out to be proportional to W?. 
The overall scale of the function L;(W) was then found by fitting it to the 
absolute luminosities of 15 spiral galaxies whose distances were accurately 
known from observations of Cepheid variables they contain. (These 15 
galaxies extended out only to 25 Mpc, not far enough for them to be used 
to measure the Hubble constant directly.) The Hubble constant could then 
be found by using the function L;(W) calibrated in this way to find the 
distances to galaxies in 14 clusters with redshifts ranging from 0.013 to 
0.03, and comparing the results obtained with Eq. (1.2.8). (These redshifts 
may not be large enough to ignore peculiar velocities altogether, but again, 
this problem is mitigated by the use of a fairly large number of galaxies.) 
The Hubble constant found in this way was 70 + 5 km s7! Mpc™!. More 
recently, the Hubble Space Telescope Key Project to Measure the Hubble 
Constant has used Cepheid variables to recalibrate the Tully—Fisher relation 
(assuming an LMC distance of 50 kpc) and then found Hp by plotting 
distances found from the Tully—Fisher relation against redshift for a sample 
of 19 clusters with redshifts from 0.007 to 0.03,>! taken from the G97 survey 
of Giovanelli et al.>2 The result was Hp = 71 + 3 +7 km s7! Mpc™!, with 
the first quoted uncertainty statistical and the second systematic. 


48M. Aaronson, J. R. Mould, and J. Huchra, Astrophys. J. 229, 1 (1979). 

4Y. Tutui et al., Publ. Astron. Soc. Japan 53, 701 (2001) [astro-ph/0108462]. 

SOR, Giovanelli, in The Extragalactic Distance Scale - Proceedings of the Space Telescope Science 
Institute Symposium held in Baltimore, MD, May, 1996 (Cambridge University Press, Cambridge, 1997): 
113; R. Giovanelli et al., Astron. J. 113, 22 (1997). 

51S, Sakai et al., Astrophys. J. 529, 698 (2000); W. L. Freedman et al., Astrophys. J. 553, 47 (2001); 
and references cited therein. 

52 Giovanelli et al., op. cit. 
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2. Faber—Jackson relation 

Just as the Tully—Fisher method is based on a correlation of orbital velocities 
with absolute luminosities in spiral galaxies, the Faber-Jackson method is 
based on a correlation of random velocities with absolute luminosities in 
elliptical galaxies.® An advantage of this method over the Tully—Fisher 
method is that it has a firmer theoretical foundation, provided by the virial 
theorem to be discussed in Section 1.9, which directly relates the mean 
square random velocity to the galaxy mass. 


3. Fundamental plane 

The Faber—Jackson method was improved by the recognition that the cor- 
relation between orbital velocity and absolute luminosity depends also on 
the surface brightness of the cluster, and hence on its area.’ (The term 
“fundamental plane” refers to the way that data on elliptical galaxies are 
displayed graphically.) This method has been used>> to estimate that Ho = 
78 + 5 (stat.) +9 (syst.) km sec~! Mpc7!. 


4. Type Ia supernovae 

Supernovae of Type Ia are believed to occur when a white dwarf star in a 
binary system accretes sufficient matter from its partner to push its mass 
close to the Chandrasekhar limit, the maximum possible mass that can be 
supported by electron degeneracy pressure.°° When this happens the white 
dwarf becomes unstable, and the increase in temperature and density allows 
the conversion of carbon and oxygen into *°Ni, triggering a thermonuclear 
explosion that can be seen at distances of several thousand megaparsecs. 
The exploding star always has a mass close to the Chandrasekhar limit, so 
there is little variation in the absolute luminosity of these explosions, mak- 
ing them nearly ideal distance indicators.°’ What variation there is seems 


535S, M. Faber and R. E. Jackson, Astrophys. J. 204, 668 (1976). 

54A, Dressler et al., Astrophys. J. 313, 42 (1987). 

55D, D. Kelson et al., Astrophys. J. 529, 768 (2000) [astro-ph/9909222]; J. P. Blakeslee, J. R. Lucey, 
J. L. Tonry, M. J. Hudson, V. K. Nararyan, and B. J. Barris, Mon. Not. Roy. Astron. Soc. 330, 443 
(2002) [astro-ph/011183]. 

56W. A. Fowler and F. Hoyle, Astrophys. J. 132, 565 (1960). Calling a supernova Type I simply means 
that hydrogen lines are not observed in its spectrum. In addition to Type Ia supernovae, there are other 
Type I supernovae that occur in the collapse of the cores of stars much more massive than white dwarfs, 
whose outer layer of hydrogen has been lost in stellar winds, as well as Type II supernovae, produced by 
core collapse in massive stars that have not lost their outer layer of hydrogen. For a discussion of the 
Chandrasekhar limit, see G&C, Section 11.3. 

57The use of Type Ia supernovae as distance indicators was pioneered by A. Sandage and G. A. 
Tammann, Astrophys. J. 256, 339 (1982), following an earlier observation that they had fairly uniform 
luminosity by C. T. Kowal, Astron. J. 73, 1021 (1968). In 1982 it was necessary to use brightest 
supergiant stars as intermediate distance indicators, to bridge the gap between the distances that could 
then be measured using Cepheids and the distances at which the Type Ia supernova could be found. 
For reviews of the use of type Ia supernovae as standard candles, see D. Branch, Ann. Rev. Astron. & 
Astrophys. 36, 17 (1998); P. H6flich, C. Gerardy, E. Linder, and H. Marion, in Stellar Candles, eds. W. 
Gieren et al. (Lecture Notes in Physics) [astro-ph/0301334]. 
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to be well correlated with the rise time and decline time of the supernova 
light: the slower the decline, the higher the absolute luminosity.°° 

This relation has been calibrated by measurements of Type Ia super- 
novae in several galaxies of known distance. From 1937 to 1999 there were 
ten supernovae in galaxies whose distance had been measured by observa- 
tion of Cepheid variables they contain.’ Of these, six Type Ia supernovae 
were used by the HST Key Ho Group™ to calibrate the relation between 
absolute luminosity and decline time. This relation was then used to calcu- 
late distances to a sample of 29 Type Ia supernovae in galaxies with redshifts 
extending from 0.01 to 0.1, observed at the Cerro Tololo Inter-American 
Observatory.®! Plotting these distances against measured redshifts gave a 
Hubble constant? of 71 + 2(statistical) + 6(systematic) km s~! Mpc~!. 
This agrees well with an older determination using Type Ia supernovae by 
a Harvard group,°* which found Hy = 67 + 7 km s~! Mpc~!. Members 
of this group have superceded this result,°* now giving a Hubble constant 
Ho = 73 £ A(stat.) + 5(syst.) km s7! Mpc~!. On the other hand, a group 
headed by Sandage using Type Ia supernovae and the Tully—Fisher relation 
has consistently found lower values of Ho.®> The gap seems to be narrow- 
ing; in 2006, this group quoted®™ a value Ho = 62.3 + 1.3(stat.) + 5.0(syst.) 
km s~! Mpc~!. (According to Sandage et al., the difference between these 
results is due to a difference in the Cepheid period—luminosity relation used 
to measure distances to the galaxies that host the supernovae that are used 
to calibrate the relation between supernova absolute luminosity and decline 
time. Sandage et al. use a metallicity-dependent period—luminosity rela- 
tion. However Macri et al.“ subsequently reported no difference in the 
period-luminosity relation for Cepheids in a metal-rich and a metal-poor 
region of NGC 4258.) 

It is an old hope that with a sufficient theoretical understanding of 
supernova explosions, it might be possible to measure their distance 


58M. Phillips, Astrophys. J. 413, L105 (1993); M. Hamuy et al., Astron. J. 109, 1 (1995); A. Reiss, 
W. Press, and R. Kirshner, Astrophys. J. 438, L17 (1996); S. Jha, A. Riess, & R. P. Kirshner, Astrophys. 
J. 659, 122 (20007). A dependence of absolute luminosity on color as well as decline time has been 
considered by R. Tripp and D. Branch, Astrophys. J. [astro-ph/9904347]. 

59 For a list, see Tripp and Branch, op. cit.. 

60B, Gibson et al., Astrophys. J. 529, 723 (2000) [astro-ph/9908149]. 

61M. Hamuy et al., Astron. J. 112, 2398 (1996). 

62L, Ferrarese et al., in Proceedings of the Cosmic Flows Workshop, eds. S. Courteau et al. (ASP 
Conference Series) [astro-ph/9909134]; W. L. Freedman et al., Astrophys. J. 553, 47 (2001). 

63A. G. Riess, W. H. Press, And R. P. Kirshner, Astrophys. J. 438, L17 (1995) 

64A, Riess et al., Astrophys. J. 627, 579 (2005) [astro-ph/0503159]. 

For a 1996 summary, see G. A. Tammann and M. Federspeil, in The Extragalactic Distance Scale, 
eds. M. Livio, M. Donahue, and N. Panagia (Cambridge University Press, 1997): 137. 

66A. Sandage et al., Astrophys. J. 653, 843 (2006) [astro-ph/0603647]. 
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without use of primary distance indicators. A 2003 comparison®’ of observed 
light curves (apparent magnitude as a function of time) and spectra with 
theory for 26 Type Ia supernovae with redshifts extending up to 0.05, plus 
one with redshift 0.38, gave Hp = 67 km sec-!Mpc~!, with a two standard 
deviation uncertainty of 8 km sec7!Mpc7!. It is too soon for this method 
to replace the older method based on the use of primary distance indicators 
to calibrate the supernova absolute luminosities, but the agreement between 
the values of Ho found in these two ways provides some reassurance that no 
large error is being made with the older method. 

It is instructive to consider a fifth secondary distance indicator that is 
also used to measure the Hubble constant: 


5. Surface brightness fluctuations 

In 1988 Tonry and Schneider®* suggested using the fluctuations in the 
observed surface brightness of a galaxy from one part of the image to 
another as a measure of the galaxy’s distance. Suppose that the stars in 
a galaxy can be classified in luminosity classes, all the stars in a luminosity 
class į having the same absolute luminosity L;. The rate of receiving energy 
per unit area of telescope aperture in a small part of the galactic image (as 
for instance, a single pixel in a charge-coupled device) is 


NiLi 
l= X p (1.3.8) 


where N; the number of stars of class i in this part of the galaxy’s image, 
and d is the distance of the galaxy. Usually only the brightest stars can 
be resolved, so it is not possible to measure all the N; directly, but one 
can measure the fluctuations in £ from one part of the image to another 
due to the finite values of the N;. Suppose that the different N; fluctuate 
independently from one small part of the galaxy’s image to another, and 
obey the rules of Poisson statistics, so that 


(Ni = (N) N; — (N) = by (Ni) >» (1.3.9) 


with brackets denoting an average over small parts of the central portion of 
the galaxy’s image. It follows then that 
C _ iL 
(£) And?’ 


(1.3.10) 


67p HO6flich, C. Gerardy, E. Linder, and H. Marion, op. cit. 
68]. Tonry and D. P. Schneider, Astron. J. 96, 807 (1988). 
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where L is a luminosity-weighted mean stellar luminosity 


ENDL? 
DN) Li 


which is expected to vary much less from one galaxy to another than the 
luminosities of the galaxies themselves. Eq. (1.3.10) can be used to mea- 
sure distances once this relation is calibrated by measuring L. By studying 
surface brightness fluctuations in a survey of galaxies whose distances were 
found by observations of Cepheids they contain, Tonry et al. found an 
absolute magnitude Mz that in the infrared band is equivalent to the abso- 
lute luminosity L: 


L= (1.3.11) 


My = (-1.74 £0.07) + (4.5 £0.25) [my — my — 1.15] (1.3.12) 


where my — my is a parameter characterizing the color of the galaxy, equal 
to the difference of its apparent magnitudes in the infrared and visual 
bands, assumed here to lie between 1.0 and 1.5. Using Eq. (1.3.10) to find 
distances of galaxies of higher redshift, they obtained a Hubble constant 
81+6kms~—! Mpc™!. 

There are other phenomena that are used to measure the Hubble 
constant, including the comparison of apparent and absolute luminosity 
of supernovae of other types, novae, globular clusters, and planetary neb- 
ulae, the diameter—velocity dispersion relation for elliptical galaxies, grav- 
itational lenses (discussed in Section 1.12), the Sunyaev—Zel’dovich effect 
(discussed in Section 2.3), etc.’”? The HST Key Ho Group have put together 
their results of measurements of the Hubble constant using the Tully—Fisher 
relation, Type Ia supernovae, and several of these other secondary distance 
indicators, and conclude that”! 


Ho = 71 +6 km s7! Mpc™! ; 


As we will see in Section 7.2, the study of anisotropies in the cosmic 
microwave background has given a value Hy) = 7343 km s7! Mpc™!. This 
does not depend on any of the tools discussed in this section, but it does 
depend on some far-reaching cosmological assumptions: including flat spa- 
tial geometry, time-independent vacuum energy, and cold dark matter. For 
this reason, the increasingly precise measurement of Ho provided by the 


6J, L, Tonry, J. P. Blakeslee, E. A. Ajhar, and A. Dressler, Astrophys. J. 473, 399 (1997). For a more 
recent survey, see J. L. Tonry et al., Astrophys. J. 546, 681 (2001) [astro-ph/001 1223]. 

For a survey of most of these methods, with references, see G. H. Jacoby, D. Branch, R. Ciardullo, 
R. L. Davies, W. E. Harris, M. J. Pierce, C. J. Pritchet, J. L. Tonry, and D. L. Welch, Publ. Astron. Soc. 
Pacific 104, 599 (1992). 

TIL, Ferrarese et al., op. cit., W. L. Freedman et al., Astrophys. J. 553, 47 (2001). 


30 


1.4 Luminosity distances and angular diameter distances 


cosmic microwave background will not supplant the older measurements 
discussed in this section — rather, the agreement (or possible future disagree- 
ment) between the values of Ho provided by these very different methods 
will serve to validate (or possibly invalidate) the cosmological assumptions 
made in the analysis of the cosmic microwave background. 

To take account of the remaining uncertainty in the Hubble constant, it 
is usual these days to take 


Ho = 100h km s™! Mpc™! , (1.3.13) 


with the dimensionless parameter h assumed to be in the neighborhood of 
0.7. This corresponds to a Hubble time 


1/Ho = 9.778 x 10? A! years . (1.3.14) 


1.4 Luminosity distances and angular diameter distances 


We must now consider the measurement of distances at large redshifts, say 
z > 0.1, where the effects of cosmological expansion on the determination 
of distance can no longer be neglected. It is these measurements that can 
tell us whether the expansion of the universe is accelerating or decelerating, 
and how fast. Before we can interpret these measurements, we will need 
to consider in this section how to define distance at large redshifts, and 
we will have to apply Einstein’s field equations to the Robertson—Walker 
metric in the following section. After that, we will return in Section 1.6 to 
the measurements of distances for large redshift, and their interpretation. 
In the previous section we derived the familiar relation £ = L/4sd? for 
the apparent luminosity £ of a source of absolute luminosity L at a distance 
d. At large distances this derivation needs modification for three reasons: 


1. At the time fo that the light reaches earth, the proper area of a sphere 
drawn around the luminous object and passing through the earth is 
given by the metric (1.1.10) as 4rr?a (to), where rı is the coordinate 
distance of the earth as seen from the luminous object, which is just the 
same as the coordinate distance of the luminous object as seen from 
the earth. The fraction of the light received in a telescope of aperture 
A on earth is therefore A /4rr?a? (to), and so the factor 1/d? in the 
formula for £ must be replaced with 1/r?a? (to). 


2. The rate of arrival of individual photons is lower than the rate at which 
they are emitted by the redshift factor a(t,)/a(to) = 1/(1 + 2). 


3. The energy hvo of the individual photons received on earth is less than 
the energy Av, with which they were emitted by the same redshift factor 
1/d +2). 
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Putting this together gives the correct formula for apparent luminosity of a 
source at radial coordinate rı with a redshift z of any size: 


L 
l= . 
47r?a? (to) (1 + 2)? 


(1.4.1) 


It is convenient to introduce a “luminosity distance” dz, which is defined so 
that the relation between apparent and absolute luminosity and luminosity 
distance is the same as Eq. (1.3.3): 


L 
= eal (1.4.2) 
And; 
Eq. (1.4.1) can then be expressed as 
dr = a(to)r1 +2). (1.4.3) 


For objects with z « 1, we can usefully write the relation between 
luminosity distance and redshift as a power series. The redshift 1 + z = 
a(to)/a(t,) is related to the “look-back time” to — tı by 


l 
z = Holt — t1) + 5 (40 + 2) HB (to — t1)? +... (1.4.4) 


where Ho is the Hubble constant (1.2.7) and go is the deceleration 
parameter 


= —l1 dat) 
— Halto) at? 


qo (1.4.5) 


t=to 


This can be inverted, to give the look-back time as a power series in the 
redshift 


1 
Hoto- t1) =z- 5go +22 +... . (1.4.6) 
The coordinate distance rı of the luminous object is given by Eq. (1.2.2) as 


to— t | Ho(to — t)? 
a(to) 2a(to) 


with the dots on the right-hand side denoting terms of third and higher order 
in rı. Using Eq. (1.4.6), the solution is 


peace E (1.4.7) 


1 
rı a(to) Ho = z — 50 + qo)? +. (1.4.8) 
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This gives the luminosity distance (1.4.3) as a power series 
-1 l 2 
dr = H; [z+ 50-402 +l. (1.4.9) 
We can therefore measure go as well as Hp by measuring the luminosity 


distance as a function of redshift to terms of order z?. The same reasoning 
has been used to extend the expression (1.4.9) to fourth order in z:! 


E 1 1 K 
di(z)=H,' z + zl — q0)2? — 6 (: — qo — 346 + jo + a z? 
070 


1 
+e — 2q0 — 1545 — 1548 + 5jo + 10gajo 


2K(1 +3 
Hag 


where jo and so are parameters known as the jerk and snap: 


1 dta(t) 
s0 = z 7 
Ho a(to) dt 


o l dave) 
— Hpa(to) dP 


Jo 


t=10 t=10 


Years ago cosmology was called “a search for two numbers,” Ho and qo. 
The determination of Hp is still a major goal of astronomy, as discussed in 
the previous section. On the other hand, there is less interest now in qo. 
Instead of high-precision distance determinations at moderate redshifts, of 
order 0.1 to 0.2, which would give an accurate value of go, we now have 
distance determinations of only moderate precision at high redshifts, of 
order unity, which depend on the whole form of the function a(t) over the 
past few billion years. For redshifts of order unity, it is not very useful to 
expand in powers of redshift. In order to interpret these measurements, we 
will need a dynamical theory of the expansion, to be developed in the next 
section. As we will see there, modern observations suggest strongly that 
there are not two but at least three parameters that need to be measured to 
calculate a(t). 

Before turning to this dynamical theory, let’s pause a moment to clar- 
ify the distinction between different measures of distance. So far, we have 
encountered the proper distance (1.1.15) and the luminosity distance (1.4.3). 
There is another sort of distance, which is what we measure when we 
compare angular sizes with physical dimensions. Inspection of the metric 


1M. Visser, Class. Quant. Grav. 21, 2603 (2004) [gr-qc/0309109]. The term of third order in z was 
previously calculated by T. Chiba and T. Nakamura, Prog. Theor. Phys. 100, 1077 (1998). 
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(1.1.12) shows that a source at co-moving radial coordinate rı that emits light 
at time ¢; and is observed at present to subtend a small angle 6 will extend 
over a proper distance s (normal to the line of sight) equal to a(t,)r,6. The 
angular diameter distance d4 is defined so that 6 is given by the usual relation 
of Euclidean geometry 


G=a/d4 (1.4.10) 
and we see that 
d4 = a(ti)ri . (1.4.11) 


Comparison of this result with Eq. (1.4.3) shows that the ratio of the 
luminosity and angular-diameter distances is simply a function of redshift: 


da/d, =(1+z)7?. (1.4.12) 


Therefore if we have measured the luminosity distance at a given redshift 
(and if we are convinced of the correctness of the Robertson—Walker met- 
ric), then we learn nothing additional about a(t) if we also measure the 
angular diameter distance at that redshift. Neither galaxies nor supernovas 
have well-defined edges, so angular diameter distances are much less use- 
ful in studying the cosmological expansion than are luminosity distances. 
However, as we shall see, they play an important role in the theoretical anal- 
ysis of both gravitational lenses in Chapter 9 and of the fluctuations in the 
cosmic microwave radiation background in Chapters 2 and 7. We will see 
in Section 8.1 that the observation of acoustic oscillations in the matter 
density may allow a measurement of yet another distance, a structure 
distance, equal to a(to)r) = (1 + z)d4. 


1.5 Dynamics of expansion 


All our results up to now have been very general, not depending on 
assumptions about the dynamics of the cosmological expansion. To go fur- 
ther we will need now to apply the gravitational field equations of Einstein, 
with various tentative assumptions about the cosmic energy density and 
pressure. 

The expansion of the universe is governed by the Einstein field equations 
(B.71), which can be put in the convenient form 


Ruv = —87GS yp , (1.5.1) 
where Rw is the Ricci tensor: 


ar} ar 
ww = oe ge TP lon Tat s (1.5.2) 
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and S» is given in terms of the energy-momentum tensor T,» by 
Suv = Tyv dg i (1.5.3) 


As we saw in Section 1.1, for the Robertson—Walker metric the components 
of the affine connection with two or three time indices all vanish, so 


ark. Ərk ər? 
Ry = | yyy 


ax/ axk Ot 


0 rk 
+ [PRM + rý ry + rh r] 


- [rira + rori] (1.5.4) 
7 re 
Roo = a + ri Th; (1.5.5) 


We don’t need to calculate Rip = Ro;, because it is a three-vector, and 
therefore must vanish due to the isotropy of the Robertson—Walker metric. 
Using the formulas (1.1.17)-(1.1.19) for the non-vanishing components of 
the affine connection gives 


r? 

n =fy5 (ad) , ry ip = ğå, T? TS 38a" , 

or d (a i 

ae (<) , r= JE ) (1.5.6) 


where dots denote time derivatives. Using this in Eqs. (1.5.4) and (1.5.5), 
we find that the non-vanishing components of the Ricci tensor are 


Ry = Ry — 20° — aay , (1.5.7) 
d få SND 6's 
Rysa- (<) 43 È) oas (1.5.8) 
dt \a a a 
where Ry is the purely spatial Ricci tensor 
k k 
> ory ar; 


According to Eq. (1.1.19), the spatial components - of the four-dimen- 
sional affine connection are here the same as those of the affine connection 
that would be calculated in three dimensions from the three-metric gj: 


= Kx*;, . (1.5.10) 
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To calculate Ry, we use a trick used earlier in calculating particle traj- 
ectories: we calculate Ry where the calculation is simplest, at x = 0, and 
express the result as a relation that is invariant under all transformations of 
the spatial coordinates, so that the homogeneity of the three-dimensional 
metric insures that this relation is valid everywhere. The spatial Ricci tensor 
at x = Qis 
iy - ary E ari, 
axis ax! 


= Kdy — 3K5y = —2K3, . (1.5.11) 


At x = 0 the spatial metric gj is just 5, so this can be rewritten as 


Ry = —2Kğjj , (1.5.12) 


which, since it is an equality between two three-tensors, is then true in all 
spatial coordinate systems, including systems in which the point x = 0 is 
transformed into any other point. Hence Eq. (1.5.12) is true everywhere, 
and together with Eq. (1.5.7) gives 


Ry = — [2K + 2%? + aii] žy . (1.5.13) 


We also need the values of Sj; and Soo. For this, we use Eq. (1.1.31) in 
the form 


To =p, To=0, Ty =apey, (1.5.14) 


where p(t) and p(t) are the proper energy density and pressure. Eq. (1.5.3) 
gives then 


Te: m | eee 1 a 
Sij = Tj- 5a (T; + T”) = a’ pkj- 50 Bij BP—P) = 5 (0 —p) a? ĝi, 
(1.5.15) 


1 1 1 
Soo = Too + 5 (Te + T°) = p+ =p) = zP + 3p) » 


2 
(1.5.16) 
and Sj9 = 0. The Einstein equations are therefore 
2K 2 ü 
-^5 -5 = AnG(p - p), (1.5.17) 
a a a 
34 
— = —4rGBÂp + p). (1.5.18) 
a 
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We can eliminate the second derivative terms by adding three times the 
first equation to the second, and find 
8G pa? 
ey gener 
3 
This is the fundamental Friedmann equation! governing the expansion of 
the universe. 


The remaining information in Eqs. (1.5.17) and (1.5.18) just reproduces 
the conservation law (1.1.32): 


(1.5.19) 


A 
p=-—(p +p). (1.5.20) 


(This should come as no surprise. Under all circumstances, the energy- 
momentum conservation law may be derived as a consequence of the Ein- 
stein field equations.) Given p as a function of p, we can solve Eq. (1.5.20) 
to find p as a function of a, and then use this in Eq. (1.5.19) to find a as a 
function of t. 

There is another way of deriving Eq. (1.5.19), at least for the case of non- 
relativistic matter. Imagine a co-moving ball cut out from the expanding 
universe, with some typical galaxy at its center, and suppose it then emptied 
of the matter it contains. According to Birkhoff’s theorem,” in any system 
that is spherically symmetric around some point, the metric in an empty ball 
centered on this point must be that of flat space. This holds whatever is hap- 
pening outside the empty ball, as long as it is spherically symmetric. Now 
imagine putting the matter back in the ball, with a velocity proportional to 
distance from the center of symmetry, taken as X = 0: 


X=A()xX. (1.5.21) 


(Here the components X’ of X are ordinary Cartesian coordinates, not the 
co-moving coordinates x! used in the Robertson—Walker metric. Note that 
this is the one pattern of velocities consistent with the principle of homog- 
eneity: The velocity of a co-moving particle at X; relative to a co-moving 
particle at X is X1 — X) = H(t)(X; — X2).) The solution of Eq. (1.5.21) is 


X(t) = ( a) ) X(t), (1.5.22) 
a(to) 


where a(t) is the solution of the equation 


a(t)/a(t) = H(t). (1.5.23) 


1A. Friedmann, Z. Phys. 16, 377 (1922); ibid 21, 326 (1924). 
2G&C, Section 11.7. 
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As long as the radius of the ball is chosen to be not too large, the expansion 
velocity (1.5.21) of the matter we put into it will be non-relativistic, and 
the gravitational field will be weak, so that we can follow its motion using 
Newtonian mechanics. The kinetic energy of a co-moving particle of mass 
mat X is 
1 g3 ma’x? 
K.E. = yim x a 


(1.5.24) 
The mass interior to the position of the particle is M(X) = 4srp|X|?/3, so 
the potential energy of the particle is 


GmM(X)__ 4xGmp |X| 


P.E. = 
|X| 3 


(1.5.25) 
The condition of energy conservation thus tells us that 


E = K.E. + P.E. = 


X(t)? | a? 40 Gp a? 
ei É a Ae | = constant . (1.5.26) 
a“ (to 


This is the same as Eq. (1.5.19), providing we identify the particle energy as 


K m|X(to)|? 


Fea 
2 a*(to) 


(1.5.27) 
Particles will be able to escape to infinity if and only if E > 0, which requires 
K = Q or K = —1. For K = +1 they have less than escape velocity, so the 
expansion eventually stops, and particles fall back toward each other. 

Returning now to the relativistic formalism and an arbitrary dependence 
of p on a, even without knowing this dependence we can use Eq. (1.5.19) to 
draw important consequences about the general features of the expansion. 
First, as long as p remains positive, it is only possible for the expansion of 
the universe to stop if K = +1, the case of spherical geometry. Also, for 
any value of the Hubble constant Ho = d(to)/a(to), we may define a critical 
present density 


3H2 
Pocrit = = = 1.878 x 107? h? g/cm? , (1.5.28) 
8r G 


where h is the Hubble constant in units of 100 km s~! Mpe~!. According to 
Eq. (1.5.19), whatever we assume about the constituents of the universe, the 
curvature constant K will be +1 or 0 or —1 according to whether the present 
density po is greater than, equal to, or less than 9 cri. Zf the quantity 3p + p 
is positive (as it is for any mixture of matter and radiation, in the absence 
of a vacuum energy density) then Eq. (1.5.18) shows that ä/a < 0, so the 
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expansion must have started with a = 0 at some moment in the past; the 
present age of the universe fo is /ess than the Hubble time 


to < Hy’. (1.5.29) 


Also, if K = +1 and the expansion stops, then with å/a < 0 the universe 
will again contract to a singularity at which a = 0. 
We can use Eq. (1.5.18) to give a general formula for the deceleration 
parameter go = —G(to)a(to)/a* (to): 
_ 4x G(po + 3p0) _ po + 3p0 


0 = , (1.5.30) 
3HE 2P0,crit 


with a subscript 0 denoting a present value. If the present density of the 
universe were dominated by non-relativistic matter then po < po, and the 


curvature constant K would be +1 or 0 or —1 according to whether qo > 5 


or go = 5 or qo < A On the other hand, if the present density of the universe 


were dominated by relativistic matter then pọ = pọ/3, and the critical value 
of the deceleration parameter at which K = 0 would be qọ = 1. Finally, if 
the present density of the universe were dominated by vacuum energy then 
Po = —po, and the value of the deceleration parameter at which K = 0 
would be go = —1. 

There is a peculiar aspect to these results. The contribution of non- 
relativistic and relativistic matter to the quantity pa? in Eq. (1.5.19) grows 
as a7! and a~?, respectively, as a — 0, so at sufficiently early times in the 
expansion we may certainly neglect the constant K, and Eq. (1.5.19) gives 

a 8G 

— > 

a? 3 
That is, at these early times the density becomes essentially equal to 
the critical density 3H7/82G, where H = å/a is the value of the Hubble 
“constant” at those times. On the other hand, we will see later that the total 
energy density of the present universe is still a fair fraction of the critical 
density. How is it that after billions of years, p is still not very different from 
Pcrit? This is sometimes called the flatness problem. 

The simplest solution to the flatness problem is just that we are in a 
spatially flat universe, in which K = 0 and p is always precisely equal to perit- 
A more popular solution is provided by the inflationary theories discussed 
in Chapter 4. In these theories K may not vanish, and p may not start out 
close to perit, but there is an early period of rapid growth in which p/ perit 
rapidly approaches unity. In inflationary theories it is expected though not 
required that p should now be very close to perit, in which case it is a good 
approximation to take K = 0. 


(1.5.31) 
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For K = Owe get very simple solutions to Eq. (1.5.19) in the three special 
cases listed in Section 1.1: 


Non-relativistic matter: Here op = po(a/ao)~>, and the solution of 
Eq. (1.5.19) with K = 0 is 


alt) x ft? , (1.5.32) 


This gives qọ = —aä/å? = 1/2, and a simple relation between the age of the 
universe and the Hubble constant 
2 

ea 6.52 x 10° hl yr. (1.5.33) 
Eqs. (1.5.32) and (1.5.18) show that for K = 0, the energy density at time t 
is p = 1/61Gt*. This is known as the Einstein—de Sitter model. It was for 
many years the most popular cosmological model, though as we shall see, 
the age (1.5.33) is uncomfortably short compared with the ages of certain 
stars. 


Relativistic matter: Here o = po(a/ao)~*, and the solution of Eq. (1.5.19) 
with K = 0 is 
a(t) x Vt. (1.5.34) 


This gives qọ = +1, while the age of the universe and the Hubble constant 
are related by 


ipa, (1.5.35) 


The energy density at time t is p = 3/327 G£. 


Vacuum energy: Lorentz invariance requires that in locally inertial 
coordinate systems the energy-momentum tensor T i of the vacuum must 
be proportional to the Minkowski metric 7” (for which nË = nj = 6, n° = 
nio = n°! = noi = 0, n° = noo = —1), and so in general coordinate systems 
T ois must be proportional to g“”. Comparing this with Eq. (B.43) shows that 
the vacuum has py = —py, so that TE” = —py g"”. In the absence of any 
other form of energy this would satisfy the conservation law 0 = T Dr: E= 
g””ðpy/Əəx”, so that py would be a constant, independent of spacetime 
position. Eq. (1.5.19) for K = 0 requires that py > 0, and has the solutions 


a(t) x exp(Ht) (1.5.36) 


where H is the Hubble constant, now really a constant, given by 


H= JEO. (1.5.37) 
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Here go = —1, and the age of the universe in this case is infinite. This 
is known as the de Sitter model? Of course, there is some matter in the 
universe, so even if the energy density of the universe is now dominated 
by a constant vacuum energy, there was a time in the past when matter 
and/or radiation were more important, and so the expansion has a finite 
age, although greater than it would be without a vacuum energy. 


More generally, for arbitrary K and a mixture of vacuum energy and 
relativistic and non-relativistic matter, making up fractions Qa, Qm, and 
Qpr of the critical energy density,* we have 


J5 seit E +2m (2y EOR (2)'] (1.5.38) 


where the present energy densities in the vacuum, non-relativistic matter, 
and and relativistic matter (i.e., radiation) are, respectively, 


3Ha ‘  3HAQy 3HQ 
ya 


R 
Bg NEM. LS 
mG ? ™ Rae PROS pg FRO 


Pvo= 


and, according to Eq. (1.5.19), 


K 
ay Ho 
Using this in Eq. (1.5.19) gives 
d. 
dt = E 
Hox/ 2a + NQgx-? + Qyx3 + Qrx-4 
—d. 
, (1.5.41) 


HAHA +9 +2 + Qu +z RA +274 


where x = a/aọ = 1/(1 + z). Therefore, if we define the zero of time 
as corresponding to an infinite redshift, then the time at which light was 
emitted that reaches us with redshift z is given by 

1 [0+2 dx 
t(z) = — | 2 
Ho Jo XV Qa + KXT? + Mx? + Qrx-4 


(1.5.42) 


3W. de Sitter, Proc. Roy. Acad. Sci. (Amsterdam), 19, 1217 (1917); ibid. 20, 229 (1917); ibid. 20, 
1309 (1917); Mon. Not. Roy. Astron. Soc., 78, 2 (1917). 

4The use of the symbol Q, instead of Qy for the ratio of the vacuum energy density to the critical 
energy density has become standard, because of a connection with the cosmological constant discussed 
in a historical note below. 
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In particular, by setting z = 0, we find the present age of the universe: 


1 f! d 
y= Í 2 (1.5.43) 
Ho Jo XV 2A + KX? + Qyx 3 + RRI 


In order to calculate luminosity or angular diameter distances, we also 
need to know the radial coordinate r(z) of a source that is observed now 
with redshift z. According to Eqs. (1.2.2) and (1.5.41), this is 


dt 
= S — 
aa if | 


-5 1 [ dx 
aoHo Jija+a x2/Qq + Qex-2 + Qyx 3+ RAT) 


where 
siny K=41 
Sp]= 4 y K=0 
sinh y K=-1. 
This can be written more conveniently by using Eq. (1.5.40) to express ap Ho 
in terms of Qx. We then have a single formula 


a(z) = —VR 
M 


x sinh ay? f i 
j 1/(1+z) XV QA + KXT? + Qy x73 + Qrx-4 i 
(1.5.44) 


which can be used for any curvature. (Eq. (1.5.43) has a smooth limit for 
Qg — 0, which gives the result for zero curvature. Also, for Qg < 0, 
the argument of the hyperbolic sine is imaginary, and we can use sinh ix = 
isinx.) Using Eq. (1.5.44) in Eq. (1.4.3) gives the luminosity distance of a 
source observed with redshift z as 

l+z 


dr(z) = aor(z)(1 +z) = nl 
Ha! 


x sinh ay? | wd 
K Jya» XV 2A + Kx + Qyx3 + QRxI 
(1.5.45) 
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For K = 0 we have Qx = 0 and Eq. (1.5.45) becomes 


l+z f! dx 
dL(Z) = aorı (1 + z) = H / ; 
0 Jata xV 2a + Rux? ++ RRX 
(1.5.46) 


As we will see in Section 2.1, QR is much less than Qy, and the integral 
(1.5.46) converges at its lower bound for z —> oo whether or not Qp vanishes, 
so it is a good approximation to take Qg = 0 here. 

It is of some interest to express the deceleration parameter go in terms 
of the Qs. The p/p ratio w for vacuum, matter, and radiation is —1, 0, and 
1/3, respectively, so Eq. (1.5.39) gives the present pressure as 


3H? l 
Po = —= | -2a + 52R).- (1.5.47) 
Eq. (1.5.30) then gives 


4nG(3po+ po) 1 
= = Qy — 224A +2QR). 1.5.48 
3H? 5 (Qu A R) ( ) 


qo 


One of the reasons for our interest in the values of QK, Qm, etc. is that 
they tell us whether the present expansion of the universe will ever stop. 
According to Eq. (1.5.38), the expansion can only stop if there is a real root 
of the cubic equation 


QAU + ku+mu =0, (1.5.49) 


where u = a(t)/a(to) is greater than one. (We are ignoring radiation here, 
since it will become even less important as the universe expands.) This 
expression has the value +1 for u = 1. If Qa < 0 then the left-hand side 
of Eq. (1.5.49) becomes negative for sufficiently large u, so it must take the 
value zero at some intermediate value of u, and the expansion will stop 
when this value of u is reached. Even for Q, > 0 it is still possible for the 
expansion to stop, provided Qx = 1 — Qa — Qy is sufficiently negative 
(which, among other things, requires that K = +1). 


Historical Note 1: If we express the total energy momentum tensor T,» as 
the sum of a possible vacuum term —pyg,, and a term T nf arising from 


matter (including radiation), then the Einstein equations take the form 


l 
Ruy — 58wR^ = -8r GTH + 82 Gpy8gw - (1.5.50) 
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Thus the effect of a vacuum energy is equivalent to modifying the Einstein 
field equations to read 


l 
Ruy — 58wR — Agu = -8r GTM , (1.5.51) 


where 
A=8xrGpry. (1.5.52) 


The quantity A is known as the cosmological constant. It was introduced 
into the field equation by Einstein in 1917 in order to satisfy a condition 
that at the time was generally regarded as essential, that the universe should 
be static. According to Eqs. (1.5.18) and (1.5.19), a static universe is only 
possible if 3p + p = 0 and K = 8x Gpa?/3. If the contents of the universe 
are limited to vacuum energy and non-relativistic matter, then pọ = py+pr, 
p= —py,and py = 0. It follows that py = 2py > 0, so K > 0, which by 
convention means K = +1, so that a takes the value ag = 1/./87Gpy = 
1/VA. This is known as the Einstein model. 

Einstein did not realize it, but his cosmology was unstable: If a is a little 
less than ag then pm is a little larger than 2oy, so Eq. (1.5.18) shows that 
a/a < 0, and a thus begins to decrease. Likewise, if a is a little greater 
than ag then it begins to increase. The models with K = +1 and A > 0 
in which a starts at the Einstein radius a = ag with py = 2py and then 
expands to infinity (or starts at a = 0 and approaches ag as t > oo with 
just enough matter so that om = 2py at the Einstein radius), are known as 
Eddington—Lemaitre models. There are also models with K = +1 and a 
little more matter, that start at a = 0, spend a long time near the Einstein 
radius, and then expand again to infinity, approaching a de Sitter model. 
These are known as Lemaitre models.’ 

Oddly, de Sitter also invented his cosmological model (with a « exp(H?)) 
in order to satisfy a supposed need for a static universe. He originally 
proposed a time-independent metric, given by 


dr? 


1- r/R — 7° d? -r° sin? 6 do? $ 


dt? = (1—1°/R?)dt? — 
(1.5.53) 


5A. Einstein, Sitz. Preuss. Akad. Wiss. 142 (1917). For an English translation, see The Principle of 
Relativity (Methuen, 1923; reprinted by Dover Publications, New York, 1952), p. 35. 

6A. sS. Eddington, Mon. Not. Roy. Astron. Soc. 90, 668 (1930); G. Lemaître, Ann. Soc. Sci. Brux. 
A47, 49 (1927); Mon. Not. Roy. Astron. Soc. 91, 483 (1931). The interpretation of the cosmolog- 
ical constant in terms of vacuum energy was stated by Lemaître in Proc. Nat. Acad. Sci. 20, 12L 
(1934). 

7G. Lemaitre, op. cit. 
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or equivalently, setting r = R sin x, 
dr? = cos” x dt? — R? (ax? + sin? xd? + sin? X sin? 0 dg’) » (1.5.54) 


with R = ,/3/A constant. De Sitter did not realize at first that this metric 
has Tjo 4 0, so that his coordinate system was not co-moving.® Only later 
was it noticed that, using co-moving spatial coordinates and cosmological 
standard time, de Sitter’s model is equivalent to a Robertson—Walker metric 
with K = 0 and a « exp(t/R). 

After the discovery of the expansion of the universe, cosmologists lost 
interest in a static universe, and Einstein came to regret his introduction of 
a cosmological constant, calling it his greatest mistake. But as we shall see 
in the next section, there are theoretical reasons to expect a non-vanishing 
vacuum energy, and there is observational evidence that in fact it does not 
vanish. Einstein’s mistake was not that he introduced the cosmological 
constant — it was that he thought it was a mistake. 


Historical Note 2: There is a cosmological model due to Bondi and Gold? 
and in a somewhat different version to Hoyle,!? known as the steady state 
theory. In this model nothing physical changes with time, so the Hubble 
constant really is constant, and hence a(t) x exp(#72), just as in the de Sitter 
model. To keep the curvature constant, it is necessary to take K = 0. In 
this model new matter must be continually created to keep p constant as the 
universe expands. Since the discovery of the cosmic microwave background 
(discussed in Chapter 2) the steady state theory in its original form has been 
pretty well abandoned. 


1.6 Distances at large redshift: Accelerated expansion 


We now return to our account of the measurement of distances as a function 
of redshift, considering now redshifts z > 0.1, which are large enough so 
that we can ignore the peculiar motions of the light sources, and also large 
enough so that we need to take into account the effects of cosmological 
expansion on distance determination. 

For many years, the chief “standard candles” used at large redshift were 
the brightest galaxies in rich clusters. It is now well established that the 


8A. S. Eddington, The Mathematical Theory of Relativity, 2nd ed. (Cambridge University Press, 
Cambridge, 1924), Section 70. It is interesting that Eddington interpreted Slipher’s observation that 
most spiral nebulae exhibit redshifts rather than blueshifts in terms of the de Sitter model, rather than 
Friedmann’s models. 

°H. Bondi and T. Gold, Mon. Not. Roy. Astron. Soc. 108, 252 (1948). 

10p Hoyle, Mon. Not. Roy. Astron. Soc. 108, 372 (1948), ibid. 109, 365 (1949). 
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absolute luminosity of these brightest galaxies evolves significantly over 
cosmological time scales. There are also severe selection effects: there is a 
tendency to pick out larger clusters with brightest galaxies of higher abs- 
olute luminosity at large distances. The evolution of brightest galaxies is of 
interest in itself, and continues to be the object of astronomical study,! but 
the use of these galaxies as distance indicators has been pretty well aban- 
doned. Similarly, although the Tully—Fisher relation discussed in Section 1.3 
has been applied to galaxies with redshifts of order unity, at these redshifts 
it is used to study galactic evolution, rather than to measure cosmological 
parameters.” 

Fortunately, the Type Ia supernovae discussed in Section 1.3 provide 
an excellent replacement as standard candles.* They are very bright; the 
peak blue absolute magnitude averages about —19.2, which compares well 
with the absolute magnitude —20.3 estimated for our own galaxy. Also, as 
described in Section 1.3, a Type Ia supernova typically occurs when a white 
dwarf member of a binary pair has accreted just enough mass to push it 
over the Chandrasekhar limit, so that the nature of the explosion does not 
depend much on when in the history of the universe this happens, or on the 
mass with which the white dwarf started or the nature of the companion 
star. But it might depend somewhat on the metallicity (the proportion of 
elements heavier than helium) of the white dwarf, which can depend on the 
epoch of the explosion. The absolute luminosity of Type Ia supernovae is 
observed to vary with environmental conditions, but fortunately in the use 
of supernovae as distance indicators the bulk of this variation is correctable 
empirically. 

Observations of Type Ia supernovae have been compared with theoretical 
predictions (equivalent to Eq. (1.5.45)) for luminosity distance as a function 
of redshift at about the same time by two groups: The Supernova Cosmology 
Project* and the High-z Supernova Search Team.> 


'See, e.g., D. Zaritsky et al., in Proceedings of the Sesto 2001 Conference on Tracing Cosmic Evolution 
with Galaxy Clusters [astro-ph/0108152]; S. Brough et al., in Proceedings of the Sesto 2001 Conference 
on Tracing Cosmic Evolution with Galaxy Clusters [astro-ph/0108 186]. 

2N. P. Vogt et al., Astrophys. J. 465, 115 (1996). For a review and more recent references, see A. 
Aragon-Salmanca, in Galaxy Evolution Across the Hubble Time — Proceedings of I.A. U. Symposium 235, 
eds. F. Combes and J. Palous [astro-ph/0610587]. 

3For reviews, see S. Perlmutter and B. P. Schmidt, in Supernovae & Gamma Ray Bursts, ed. K. Weiler 
(Springer, 2003) [astro-ph/0303428]; P. Ruiz-Lapuente, Astrophys. Space Sci. 290, 43 (2004) [astro- 
ph/0304108]; A. V. Filippenko, in Measuring and Modeling of the Universe (Carnegie Observatories 
Astrophysics Series, Vol 2., Cambridge University Press) [astro-ph/0307139]; Lect. Notes Phys. 645, 
191 (2004) [astro-ph/0309739]; N. Panagia, Nuovo Cimento B 210, 667 (2005) [astro-ph/0502247]. 

48, Perlmutter et al., Astrophys. J. 517, 565 (1999) [astro-ph/9812133]. Also see S. Perlmutter et al., 
Nature 391, 51 (1998) [astro-ph/9712212]. 

5A. G. Riess et al., Astron. J. 116, 1009 (1998) [astro-ph/9805201]. Also see B. Schmidt et al., 
Astrophys. J. 507, 46 (1998) [astro-ph/9805200]. 
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The Supernova Cosmology Project analyzed the relation between 
apparent luminosity and redshift for 42 Type Ia supernovae, with redshifts 
z ranging from 0.18 to 0.83, together with a set of closer supernovae from 
another supernova survey, at redshifts below 0.1. Their original results are 
shown in Figure 1.1. 

With a confidence level of 99%, the data rule out the case Qa = 0 (or 
Qa < 0). Fora flat cosmology with Qg = Qr = 0, so that Q, + Qm = 1, 
the data indicate a value 


Qu = 0.28+0:03 (lo statistical) + 9-05 (identified systematics) 


(These results are independent of the Hubble constant or the absolute 
calibration of the relation between supernova absolute luminosity and time 
scale, though they do depend on the shape of this relation.) This gives the 
age (1.5.43) as 


70 km s~! Mpc7! 


to = 13.4773 x 10° r. 
0 —1.0 Ho y 
QOM,QA) = 
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(0.5,0.5) (0, 0) 
(1, 0) (1,0) 
24 (1.5,-0.5) (2, 0) 
Supernova $ = S 
Cosmology im a 
22 Proj 


ga Calan/Tololo 


(Hamuy et al. 


Effective mg 


18 


16 


0.02 0.05 0.1 0.2 0.5 1.0 
Redshift z 


14 


Figure 1.1: Evidence for dark energy, found in 1998 by the Supernova Cosmology Project, 
from S. Perlmutter et al., Astrophys. J. 517, 565 (1999) [astro-ph/9812133]. Here the effective 
blue apparent magnitude (corrected for variations in absolute magnitude, as indicated by 
supernova light curves) are plotted versus redshift for 42 high redshift Type Ia supernovae 
observed by the Supernova Cosmology Project, along with 18 lower redshift Type Ia super- 
novae from the Calán-Tololo Supernovae Survey. Horizontal bars indicate the uncertainty 
in cosmological redshift due to an assumed peculiar velocity uncertainty of 300 km sec™!. 
Dashed and solid curves give the theoretical effective apparent luminosities for cosmological 
models with Qx = 0 or Qa = 0, respectively, and various possible values of Q q. 
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For Qy = 0.28 and Q, = 1 — Qm, Eq. (1.5.48) gives a negative decelera- 
tion parameter, qo = —0.58, indicating that the expansion of the universe is 
accelerating. 

The High-z Supernova Search Team originally studied 16 Type Ia super- 
novae of high redshift (with redshifts ranging from 0.16 to 0.97), including 
2 from the Supernova Cosmology Project, together with 34 nearby super- 
novae, and conclude that Qa > 0 at the 99.7% confidence level, with no 
assumptions about spatial curvature. Their original results are shown in 
Figure 1.2. 

Their best fit for a flat cosmology is Qm = 0.28+0.10 and Q, = 1-Qy, 
giving an age of about (14.2+1.5) x 10° years, including uncertainties in the 
Cepheid distance scale. Assuming only Qm > 0, and with a conservative 
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Figure 1.2: Evidence for dark energy, found in 1998 by the High-z Supernova Search Team, 
from A. G. Riess et al., Astron. J. 116, 1009 (1998) [astro-ph/9805201]. In the upper panel 
distance modulus is plotted against redshift for a sample of Type Ia supernovae. The curves 
give the theoretical results for two cosmologies with Q, = 0 and a good-fit flat cosmology 
with Q qy = 0.24 and Qa = 0.76. The bottom panel shows the difference between data and 
a formerly popular Einstein-de Sitter model with Qyy = 0.2 and Q, = 0, represented by 
the horizontal dotted line. 
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fitting method, with 99.5% confidence they conclude that go < 0, again 
strongly indicating an accelerated expansion. Including 8 new supernovae 
in a sample of 230 supernovae of Type Ia gave® 1.4 Qm— 9a = —0.35+0.14, 
providing further evidence that Q, > 0. The case for vacuum energy was 
then strenghtened when the Supernova Cosmology Project,’ including a 
new set of supernova, found for a flat universe that Q4 = 0.754096 (stat.) + 
0.032(syst.). 

Both groups agree that their results are chiefly sensitive to a linear combi- 
nation of Q, and Qy, given as 0.82m — 0.624 by the Supernova Cosmol- 
ogy Project and Qy — Qa or 1.42m — Q1 by the High-z Supernova Search 
Team. The minus sign in these linear combinations, as in Eq. (1.5.48), 
reflects the fact that matter and vacuum energy have opposite effects on 
the cosmological acceleration: Matter causes it to slow down, while a pos- 
itive vacuum energy causes it to accelerate. The negative values found for 
these linear combinations shows the presence of a component of energy 
something like vacuum energy, with p ~ —p. This is often called dark 
energy. 

Incidentally, these linear combinations of Q, and Q y are quite different 
from the expression Qyy/2 — Qa, which according to Eq. (1.5.48) gives the 
deceleration parameter go that was the target of much cosmological work 
of the past. Thus the observations of Type Ia supernovae at cosmological 
distances should not be regarded as simply measurements of qo. 

The High-z Supernova Search Team subsequently began to use the same 
survey observations to follow the time development of supernovae that were 
used to find them. They discovered 23 new high redshift supernovae of 
Type Ia, including 15 with z > 0.7. Using these new supernovae along 
with the 230 used earlier by Tonry et al., and with the assumption that 
Qu + Qa = 1, they found the best-fit values Qy = 0.33 and Qa = 0.67. 

The crucial feature of the supernova data that indicates that Q, > Qm 
is that the apparent luminosity of Type Ia supernovae falls off more rapidly 
with redshift than would be expected in an Einstein-de Sitter cosmology 
with Qy = l and Qa, = 0. We can see the effect of vacuum energy 
on apparent luminosity by comparing the luminosity distance calculated 
in two extreme cases, both with no matter or radiation. For a vacuum- 
dominated flat model with Q, = 1 and Qn = Qy = Qr = 0, Eq. (1.5.46) 
gives 


z+2? 


di(z) = (vacuum dominated) , (1.6.1) 


6J. L. Tonry et al., Astrophys. J. 594, 1 (2003) [astro-ph/0305008}. 
TR. Knop et al., Astrophys. J. 598, 102 (2003) [astro-ph/0309368]. 
8B. J. Barris et al., Astrophys. J. 502, 571 (2004) [astro-ph/0310843}]. 
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while for an empty model with Qg = l and Qa = Qy = QR = 0, 
Eq, (1.5.46) gives 


di(z) = (empty) , (1.6.2) 


Ho 
Evidently for all z, vacuum energy increases the luminosity distance. The 
same increase is seen if we compare the more realistic case Q, = 0.7, QM = 
0.3, Qg = Qpr = 0 with the corresponding case without vacuum energy 
and Qn = 0.7, Qm = 0.3, Qa = Qr = 0, as can be seen in Figure 1.3. 

Both the Supernova Cosmology Project and the High-z Supernova Search 
Team found that curve of measured luminosity distances vs. redshift of Type 
Ia supernovae was closer to the upper than the lower curve in Figure 1.3. 
Indeed, according to Eq. (1.4.9), the negative value of go found by all groups 
corresponds to the fact that the apparent luminosity of the type Ia super- 
novae seen at moderate redshifts is /ess than in the empty model, for which 
qo = 0, in contrast with what had been expected, that the expansion is dom- 
inated by matter, in which case we would have had go > 0, and the apparent 
luminosities at moderate redshifts would have been larger than for go = 0. 

The connection between an accelerating expansion and a reduced 
apparent luminosity can be understood on the basis of the naive Newtonian 
cosmological model discussed in Section 1.5. In this model, the redshift we 
observe from a distant galaxy depends on the speed the galaxy had when 
the light we observe was emitted, but the apparent luminosity is inversely 
proportional to the square of the distance of this galaxy now, because the 
galaxy’s light is now spread over an area equal to 4z times this squared dis- 
tance. If the galaxies we observe have been traveling at constant speed since 
the beginning, as in the empty model, then the distance of any galaxy from 


dt Ho 
1.5 


z 
0.2 0.4 0.6 0.8 1 


Figure 1.3: Luminosity distance versus redshift for two cosmological models. The upper 
solid curve is for the case Q4 = 0.7, Qy = 0.3, Qg = QR = 0; the lower dashed curve 
is for an empty model, with Qx = 1, QA = Qy = QR = 0. The vertical axis gives the 
luminosity distance times the Hubble constant. 
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us now would be proportional to its speed when the light was emitted. In 
the absence of a vacuum energy, we would expect the galaxies to be slowing 
down under the influence of their mutual gravitational attraction, so that 
the speed we observe would be greater than the speed they have had since 
the light was emitted, and their distances now would therefore be smaller 
than they would be if the speeds were constant. Thus in the absence of 
vacuum energy we would expect an enhanced apparent luminosity of the 
supernovae in these galaxies. In fact, it seems that the luminosity distances 
of supernovae are larger than they would be if the speeds of their host 
galaxies were constant, indicating that these galaxies have not been slowing 
down, but speeding up. This is just the effect that would be expected from 
a positive vacuum energy. 

Of course, it is also possible that the reduction in apparent luminosity 
is due to absorption or scattering of light by intervening material rather 
than an accelerated expansion. It is possible to distinguish such effects 
from a true increase in luminosity distance by the change in the apparent 
color produced by such absorption or scattering, but this is a complicated 
business.’ This concern has been allayed by careful color measurements. !° 
But it is still possible to invent intergalactic media (so-called gray dust) that 
would reduce the apparent luminosity while leaving the color unchanged. 

This concern has been largely put to rest, first by the study!! of the super- 
novae SN1997ff found in the Hubble Deep Field! in a galaxy with a redshift 
z = 1.7+0.1, the greatest yet found for any supernova, and then by the dis- 
covery and analysis by a new team, the Higher-z Supernova Team,!? of 16 
new Type Ia supernovae, of which six have z > 1.25. These redshifts are so 
large that during a good part of the time that the light from these supernovae 
has been on its way to us, the energy density of the universe would have been 
dominated by matter rather than by a cosmological constant, and so the 
expansion of the universe would have been decelerating rather than acceler- 
ating as at present. Thus if the interpretation of the results of the two groups 
at smaller redshifts in terms of Qy and Q4 is correct, then the apparent 
luminosity of these supernovae should be /arger than would be given by a 
linear relation between luminosity distance and redshift, a result that could 
not be produced by absorption or scattering of light. We see this in Figure 
1.4, which shows the difference between the luminosity distance (in units 
Ao) for the realistic case with Q, = 0.7, Qm = 0.3, Qk = Qr = 0 and for 


See e.g., A. Aguirre, Astrophys. J. 525, 583 (1999) [astro-ph/9904319]. 

10R, Knop et al, ref. 7; also see M. Sullivan et al, Mon. Not. Roy. Astron. Soc. 340, 1057 (2003) 
[astro-ph/0211444]. 

11A, G. Riess et al., Astrophys. J. 560, 49 (2001) [astro-ph/0104455]. 

12R, L. Gilliland, P. E. Nugent, and M. M. Phillips, Astrophys. J. 521, 30 (1999). 

13A, G. Riess et al., Astrophys. J. 607, 665 (2004) [astro-ph/0402512]. 
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Figure 1.4: The luminosity distance times Ho for the realistic case Q4 = 0.7, Qyy = 0.3, 
Qg = QR = 0, minus its value for the empty case Qg = 1, QA = Qy = QR = 0, plotted 
against redshift. 


the empty model with Qg = 1, Qm = Qe = Qr = 0. We see that luminos- 
ity distances for the realistic model are greater than for the empty model for 
moderate redshift, but become less than for the empty model for z > 1.25. 
This is just what is seen. The apparent luminosity of all supernovae is consis- 
tent with the parameters Qm ~ 0.3, Qa ~ 0.7 found in the 1998 studies, but 
not consistent with what would be expected for gray dust and Q, = 0. These 
conclusions have subsequently been strengthened by the measurement of 
luminosity distances of additional Type Ia supernovae with redshifts near 
0.5.14 In 2006 Riess et al.!5 announced the discovery with the Hubble Space 
Telescope of 21 new Type Ia supernovae, which included 13 supernovae with 
redshifts z > 1 measured spectroscopically (not just photometrically). Their 
measured luminosity distances and redshifts, together with data on previ- 
ously discovered Type Ia supernovae, gave further evidence of a transition 
from a matter-dominated to a vacuum energy-dominated expansion, and 
showed that the pressure/density ratio of the vacuum energy for z > 1 is 
consistent with w = —1, and not rapidly evolving. 

Another serious concern arises from the possibility that the absolute 
luminosity of Type Ia supernovae may depend on when the supernovae 
occur. Because Type Ia supernovae occur at a characteristic moment in the 
history of a star, evolution effects on the luminosities of these supernovae 
are not expected to be as important as for whole galaxies, which at great 
distances are seen at an earlier stage in their history.!© Even so, the absolute 
luminosity of a Type Ia supernova is affected by the chemical composition 


14A., Clocchiatti et al., Astrophys. J. 642, 1 (2006) [astro-ph/0510155]. 

ISA. Riess et al, Astrophys. J. 659, 98 (2007) [astro-ph/061 1572]. 

16D, Branch, S. Perlmutter, E. Baron, and P. Nugent, contribution to the Supernova Acceleration 
Probe Yellow Book (Snowmass, 2001) [astro-ph/0109070]. 
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of the two progenitor stars of the supernova, which is in turn affected by the 
evolution of the host galaxy.'? Such effects are mitigated by taking account 
of the correlation of supernova absolute luminosity with decay time and 
with intrinsic color, both of which presumably depend on the progenitor’s 
chemical composition. Also, evidence for dark energy has been found in 
studies of subsets of Type Ia supernovae found in very different environ- 
ments with very different histories! The study of the seven supernovae 
with z > 1.25 mentioned above rules out models with Q4 = 0 and any sort 
of dramatic monotonic evolution of supernovae absolute luminosities that 
would mimic the effects of dark energy. 

There are other effects that might possibly impact the observed relation 
between supernova apparent luminosities and redshifts: 


1. The effect of weak gravitational lensing on the implications of the super- 
nova observations is expected to be small,!? except perhaps for small 
area surveys.” (Gravitational lensing is discussed in Chapter 9.) It had 
been thought that the apparent luminosity of the most distant observed 
supernova, SN1997ff, may be enhanced by gravitational lensing,”! con- 
ceivably reopening the possibility that the reduction of the apparent 
luminosity of the nearer supernovae is due to gray dust. However, a 
subsequent analysis by the same group” reported that the magnifica- 
tion of this supernova due to gravitational lensing is less than had been 
thought, and that the effect of the corrections due to gravitational lens- 
ing on current cosmological studies is small. Members of the High-z 
Supernova project have reported that instead this effect is likely to imp- 
rove agreement with the estimate that Qyy = 0.35 and Q, = 0.65.73 


2. It has been argued that inhomogeneities in the cosmic distribution of 
matter could produce an accelerating expansion, without the need 
for any sort of exotic vacuum energy.” Given the high degree of 


'7P Podsiadlowski et al., astro-ph/0608324. Evolution may also affect the extinction of light by dust 
in the host galaxy; see T. Totani and C. Kobayashi, Astrophys. J. 526, 65 (1999). 

18M. Sullivan et al., ref. 10. 

19A, J. Barber, Astron. Soc. Pacific Conf. Ser. 237, 363 (2001) [astro-ph/0109043]. 

20. Cooray, D. Huterer, and D. E. Holz, Phys. Rev. Lett. 96, 021301 (2006). 

215 Mortstell, C. unnarsson, and A. Goobar, Astrophys. J. 561, 106 (2001); C. Gunnarsson, in 
Proceedings of a Conference on New Trends in Theoretical and Observational Cosmology — Tokyo, 2001 
[astro-ph/0112340]. 

22], Jönsson et al., Astrophys. J. 639, 991 (2006) [astro-ph/0506765]. 

23N. Benitez et al., Astrophys. J. 577, L1 (2002) [astro-ph/0207097]. 

24E, W. Kolb, S. Matarrese, A. Notari, and A. Riotto, Astrophys. J. 626, 195 (2005) [hep-th/0503117]; 
E. W. Kolb, S. Matarrese, and A. Riotto, New J. Phys. 8, 322 (2006) [astro-ph/0506534]; E. Barausse, S. 
Matarrese, and A. Riotto, Phys. Rev. D 71, 063537 (2005). 
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homogeneity of the universe when averaged over sufficiently large 
scales, this seems unlikely.?° 


3. There is some evidence for two classes of type Ia supernovae,”° with 
the minority associated perhaps with merging white dwarfs, or with 
a variation in explosion physics. The effect on cosmological studies 
remains to be evaluated. 


4. Other uncertainties that can degrade the accuracy of measurements 
of dark energy (without casting doubt on its existence) arise from the 
circumstance that the shape of the curve of luminosity distance versus 
redshift is found by numerous observatories, both ground-based and 
space-based, and there are various flux calibration errors that can arise 
between these different observatories. 


5. The measurement of luminosity distance of any source of light at large 
redshift has historically been plagued by the fact that measurements 
are not “bolometric,” that is, equally sensitive to all wavelengths, but 
are rather chiefly sensitive to wavelengths in a limited range. The 
cosmological redshift changes the apparent colors of sources, and 
thereby changes the sensitivity with which apparent luminosity is mea- 
sured. To take this into account, the observed apparent magnitude is 
corrected with a so-called K-correction.?’ The K-correction for super- 
novae were worked out before the discovery of dark energy,” and has 
been refined subsequently.”? As the precision of supernovae observations 
improves, further improvements may also be needed in the K-correction. 


These observations of an accelerated expansion are consistent with the 
existence of a constant vacuum energy, but do not prove that this energy 
density really is constant. According to Eq. (1.5.18), the existence of an 
accelerated expansion does however require that a large part of the energy 
density of the universe is in a form that has p + 3p < 0, unlike ordinary 
matter or radiation. This has come to be called dark energy.*® 


25 É, Flanagan, Phys. Rev. D71, 103521 (2005) [hep-th/0503202]; G. G. Geshnizjani, D. J. H. Chung, 
and N. Afshordi, Phys. Rev. D 72, 023517 (2005) [astro-ph/0503553]; C. M. Hirata and U. Seljak, Phys. 
Rev. D 72, 083501 (2005) [astro-ph/0503582]; A. Ishibashi and R. M. Wald, Class. Quant. Grav. 23, 
235 (2006) [gr-qc/0509108]. 

26D, Howell et al., Nature 443, 308 (2006); S. Jha, A. Riess, & R. P. Kirshner, Astrophys. J. 654, 122 
(2007); R. Quimby, P. Höflich, and J. C. Wheeler, 0705.4467. 

27For a discussion of the K-correction applied to observations of whole galaxies, and original refer- 
ences, see G&C, p. 443. 

28A. Kim, A. Goobar, and S. Perlmutter, Proc. Astron. Soc. Pacific 108, 190 (1995) [astro- 
ph/9505024]. 

2p Nugent, A. Kim, and S. Perlmutter, Proc. Astron. Soc. Pacific 114, 803 (2002) [astro-ph/0205351]. 

30For a general review, see P. J. E. Peebles and B. Ratra, Rev. Mod. Phys. 75, 559 (2003). 
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To take into account the possibility that the dark energy density is not 
constant, it has become conventional to analyze observations in terms of 
its pressure/density ratio pp.£./ep.E. = w. Except in the case of a con- 
stant vacuum energy density, for which w = —1, there is no special reason 
why w should be time-independent. (A different, more physical, possibil- 
ity is explored at the end of Section 1.12.) Still, it is popular to explore 
cosmological models with w constant but not necessarily equal to —1. As 
long as the dark energy density and Qx are non-negative, the expansion of 
the universe will continue, with å always positive. As shown in Eq. (1.1.34), 
the dark energy density in this case goes as a~>~>", so if w is negative (as 
indicated by the supernova observations) the energy density of radiation and 
matter must eventually become negligible compared with the dark energy 
density. For w < —1/3, the effect of a possible curvature in the Friedmann 
equation (1.5.19) also eventually becomes negligible. The solution of this 
equation for w > —1 with å > 0 then becomes ¢ — tı —> CaS+3")/2, with 
C > 0, and ¢ an integration constant. This is a continued expansion, witha 
decreasing expansion rate. But for w < —1, sometimes known as the case of 
phantom energy, the solution with à > Ois instead t;-—t > Ca@+3")/2, again 
with C > 0. This solution has the remarkable feature that a(t) becomes 
infinite at the time ¢,;. In contrast with the case w > —1, for w < —1 all 
structures — galaxy clusters, galaxy clusters, stars, atoms, atomic nuclei, 
protons and neutrons — eventually would be ripped apart by the repulsive 
forces associated with dark energy.*! 

To further study the time dependence of the dark energy, a five year 
supernova survey, the Supernova Legacy Survey, was begun in 2003 at the 
Canada—France—Hawaii telescope on Mauna Kea. At the end of the first 
year, 71 high redshift Type Ia supernovae had been discovered and studied, 
with the result that Qy = 0.263 40.042(stat) + 0.032(sys). Combining this 
supernova data with data from the Sloan Digital Sky Survey (discussed 
in Chapter 8), and assuming that the dark energy has w = p/p time- 
independent, it is found that if w is constant then w = — 1.023 +0.09(stat) + 
0.054(sys), consistent with the value w = —1 for a constant vacuum energy. 
At the time of writing, results have just become available for 60 Type Ia 
supernovae from another supernova survey, ESSENCE.*? (The acronym is 
for Equation of State: Supernovae trace Cosmic Expansion). Combining 
these with the results of the Supernova Legacy Survey, the ESSENCE group 
found that if w is constant then w = —1.07 + 0.09(stat, lo) + 0.13(syst), 
and Qy = 0.267 (stat, lo). 


3IR. R. Caldwell, M. Kamionkowski, and N. N. Weinberg, Phys. Rev. Lett. 91, 071301 (2003) 
[astro-ph/0302506]. 

32P Astier, et al., Astron. Astrophys. 447, 31 (2006) [astro-ph/0510447]. 

33M. Wood-Vesey et al., astro-ph/0701041 
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The conclusion that dark energy makes up a large fraction of the energy 
of the universe has been confirmed by observations of the cosmic microwave 
background, as discussed in Section 7.2. This conclusion has also received 
support from the use of a different sort of secondary distance indicator, the 
emission of X-rays from hot gas in galaxy clusters. In Section 1.9 we will see 
that the measurement of redshift, temperature, apparent X-ray luminosity 
and angular diameter of a cluster allows a determination of the ratio of hot 
gas (“baryons”) to all matter in the cluster, with this ratio proportional to 
a a where d4 is the assumed angular-diameter distance of the cluster. 
This can be turned around: under the assumption that the ratio of hot gas 
to all matter is the same for all clusters in a sample, X-ray observations can 
be used to find the dependence of the cluster angular diameter distances 
on redshift.?4 In this way, observations by the Chandra satellite of X-rays 
from 26 galaxy clusters with redshifts in the range 0.07 < z < 0.9 have been 
used to determine that in a cosmology with a constant vacuum energy and 
cold dark matter, Q, = 0.941921, within 68% confidence limits.? Relax- 
ing the assumption that the cosmological dark energy density is constant, 
but assuming Qx = 0 and a constant w, and taking the baryon density 
to have the value indicated by cosmological nucleosynthesis (discussed in 
Section 3.2), this analysis of the Chandra data yields 1 — Qyy = 0.76 + 0.04 
and a dark energy pressure/density ratio w = — 1 2040:24, 

It is possible that measurements of luminosity distance can be pushed 
to much larger redshifts by the use of long gamma ray bursts as secondary 
distance indicators. These bursts definitely do not have uniform absolute 
luminosity, but there are indications that the absolute gamma ray luminosity 
is correlated with the peak gamma ray energy and a characteristic time 
scale.36 

The discovery of dark energy is of great importance, both in interpreting 
other observations and as a challenge to fundamental theory. It is pro- 
foundly puzzling why the dark energy density is so small. The contribution 
of quantum fluctuations in known fields up to 300 GeV, roughly the highest 
energy at which current theories have been verified, gives a vacuum energy 


345S, Sasaki, Publ. Astron. Soc. Japan 48, 119 (1996) [astro-ph/9611033]; U.-L. Pen, New Astron. 2, 
309 (1997) [astro-ph/9610147]. 

355, W. Allen, R. W. Schmidt, H. Ebeling, A. C. Fabian, and L. van Speybroeck, Mon. Not. Roy. 
Astron. Soc. 353, 457 (2004) [astro-ph/0405340]. For earlier applications of this technique, see K. Rines 
et al., Astrophys. J. 517, 70 (1999); S. Ettori and A. Fabian, Astron. Soc. Pac. Conf. Ser. 200, 369 
(2000); S. W. Allen, R. W. Schmidt, and A. C. Fabian, Mon. Not. Roy. Astron. Soc. 334, L11 (2002); 
S. Ettori, P, Tozzi, and P. Rosati, Astron. & Astrophys. 398, 879 (2003). The possibility of a variable 
ratio of hot gas to all matter is explored by R. Sadat et al., Astron. & Astrophys. 437, 310 (2005); L. D. 
Ferramacho and A. Blanchard, Astron. & Astrophys. 463, 423 (2007) [astro-ph/0609822]. 

36C, Firmani, V. Avila-Reese, G. Ghisellini, and G. Ghirlanda, Mon. Not. Roy. Astron. Soc. 372, 
28 (2006) [astro-ph/0605430]; G. Ghirlanda, G. Ghisellini, and C. Firmani, New J. Phys. 8, 123 (2006) 
[astro-ph/0610248]. 
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density of order (300 GeV),* or about 107’ g/cm?. This of course is vastly 
larger than the observed dark energy density, Qy po,crit ~ 1077 g/cm?, by 
a factor of order 10°°. There are other unknown contributions to the vac- 
uum energy that might cancel this contribution, coming from fluctuations 
in fields at higher energies or from the field equations themselves, but this 
cancelation would have to be precise to about 56 decimal places. There is 
no known reason for this remarkable cancelation.*” The discovery of dark 
energy now adds a second problem: why is the dark energy density compa- 
rable to the matter energy density at this particular moment in the history 
of the universe? 

In thinking about these problems, it is crucial to know whether the vac- 
uum energy is really time-independent, or varies with time, a question that 
may be answered by future studies of distant Type Ia supernovae or other 
measurements at large redshift. The possibility of a varying dark energy 
(known as quintessence) will be considered further in Section 1.12. 


1.7 Cosmic expansion or tired light? 


In comparing observations of redshifts and luminosity distances with theory, 
we rely on the general understanding of redshifts and luminosities outlined 
in Sections 1.2 and 1.4. One thing that might invalidate this understanding 
is absorption or scattering, which reduces the number of photons reach- 
ing us from distant sources. This possibility is usually taken into account 
by measuring the color of the source, which would be affected by absorp- 
tion or scattering, though as mentioned in the previous section there is a 
possibility of gray dust, which could not be detected in this way. Another 
possible way that apparent luminosities could be reduced is through the 
conversion of photons into particles called axions by intergalactic magnetic 
fields. There is also a more radical possibility. Ever since the discovery of 
the cosmological redshift, there has been a nagging doubt about its interpre- 
tation as evidence of an expanding universe. Is it possible that the universe 
is really static, and that photons simply suffer a loss of energy and hence 
a decrease in frequency as they travel to us, the loss of energy and hence 
the redshift naturally increasing with the distance that the photons have to 
travel? 

It is possible to rule out all these possibilities by comparing the 
luminosity distance dz (z) with the angular diameter distance d4(z) of the 
same distant source. None of the possibilities mentioned above can affect 
the angular diameter distance, while the conventional interpretation of 


37For a survey of efforts to answer this question, see S. Weinberg, Rev. Mod. Phys. 61, 1 (1989). 
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redshifts and luminosities provides the model-independent result (1.4.12), 
that dz (z)/d4(z) = (1 +z)’, so a verification of this ratio can confirm the 
conventional understanding of dz (z). 

We can check this formula for dz (z)/d4(z) by a “surface brightness test” 
suggested long ago by Tolman.! If a light source has an absolute luminos- 
ity per proper area £, then the apparent luminosity of a patch of area A 
will be £ = LA Jánd}. This patch will subtend a solid angle Q = A fae 
The surface brightness B is defined as the apparent luminosity per solid 
angle, so 
ae a 
2 4rd? l 
In the conventional big bang cosmology the ratio d4/dz is given by 
Eq. (1.4.12), so 


B 


(1.7.1) 


B=(1+z)"4 (=) (1.7.2) 


If we can find a class of light sources with a common value for the absolute 
luminosity per proper area £, then their surface brightness should be found 
to decrease with redshift precisely as (1 + z)~*. 

For instance, one important difference between “tired light” theories 
and the conventional big bang theory is that in the conventional theory all 
rates at the source are decreased by a factor (1 + z)~!, while in tired light 
theories there is no such slowing down. One rate that is slowed down at 
large redshifts in the conventional theory is the rate at which photons are 
emitted by the source. This is responsible for one of the two factors of 
(+z)! in formula (1.4.1) for apparent luminosity, the other factor being 
due to the reduction of energy of individual photons. On the other hand, if 
the rate of photon emission is not affected by the redshift, then in a static 
Euclidean universe in which photons lose energy as they travel to us, the 
apparent luminosity of a distant source L at a distance d will be given by 
L/4x(1 + z)d’, with only a single factor 1 + z in the denominator to take 
account of the photon energy loss. That is, the luminosity distance will be 
(1 + z)!/2d, while the angular diameter distance in a Euclidean universe is 
just d, so here dy /d4 = (1 + z)!/?, and the surface brightness of distant 
galaxies should decrease as (1 + z)~!. 

Lubin and Sandage? have used the Hubble Space Telescope to compare 
the surface brightness of galaxies in three distant clusters with redshifts 


IR. C. Tolman, Proc. Nat. Acad. Sci 16, 5111 (1930); Relativity, Thermodynamics, and Cosmology 
(Oxford Press, Oxford, 1934): 467. 

2L, M. Lubin and A. Sandage, Astron. J. 122, 1084 (2001) [astro-ph/0106566]. Their earlier work is 
described in A. Sandage and L. M. Lubin, Astron. J. 121, 2271 (2001); L. M. Lubin and A. Sandage, 
ibid, 2289 (2001) and Astron. J. 122, 1071 (2001) [astro-ph/0106563.] 


58 


1.8 Ages 


0.76, 0.90, and 0.92 with the surface brightness measured in closer galax- 
ies. They detect a decrease of B with increasing z that is consistent with 
Eq. (1.7.2) with reasonable corrections for the effects of galaxy evolution, 
and is quite inconsistent with the behavior B œ (1+z)~! expected in a static 
universe with “tired light.” 

In the standard big bang cosmology all rates observed from a distant 
source are slowed by a factor 1/(1 + z), not just the rate at which photons 
are emitted. This slowing has been confirmed? for the rate of decline of light 
from some of the Type Ia supernovae used by the Supernova Cosmology 
Project discussed in the previous section. The hypothesis that the absolute 
luminosity is simply correlated with the intrinsic decline time is found to 
work much better if the observed decline time is taken to be the intrinsic 
decline time stretched out by a factor 1 + z. Nothing like this would be 
expected in a static Euclidean universe with redshifts attributed to tired 
light. 


1.8 Ages 


As we have seen, a knowledge of the Hubble constant and of the matter and 
vacuum density parameters Q my and Q, allows us to estimate the age of the 
universe. In this section we will discuss independent estimates of the age of 
the universe, based on calculations of the ages of some of the oldest things 
it contains. 

Since metals (elements heavier than helium) found in the outer parts 
of stars arise chiefly from earlier generations of stars, the oldest stars are 
generally those whose spectra show small abundances of metals. These are 
the so-called Population II stars, found in the halo of our galaxy, and in 
particular in globular clusters. There are two main ways of estimating ages 
of old stars: 


A. Heavy element abundances 


Ifa nucleus decays with decay rate 4, and has an initial abundance Ainit, then 
the abundance A after a time T is A = Ainit exp(—AT). Hence if we knew 
Ainit and could measure A, we could determine T from T = A~! In(Ajnit /A). 
Unfortunately neither condition is likely to be satisfied. On the other hand, 
it is often possible to calculate the ratio of the initial abundances A init and 
Ad init Of two nuclei, and to measure their relative present abundance A, /A2. 


3B. Leibundgut et al., Astrophys. J. 466, L21 (1996); G. Goldhaber et al. (Supernova Cosmology 
Project), Astrophys. J. 558, 359 (2001) [astro-ph/0104382]. 
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This relative abundance is given by 


A\ (2) 
F= exp (Q2 - ANT), 
A2 Á? init 


1 A aes 
T= [in ( -)-m( at) | (1.8.1) 
à2— À] A Apinit 


If the initial abundances are similar and the observed abundances are very 
different, then the estimated value of T will be insensitive to the precise 
value of the initial abundance ratio. 

The initial relative abundances of heavy, radioactive elements are est- 
imated on the well-founded assumption that these elements are made in the 
so-called r-process, the rapid addition of neutrons to lighter elements such 
as iron in core-collapse supernova explosions, after which the neutron-rich 
isotopes formed in this way undergo multiple beta decays, transforming 
them to the most deeply bound nuclei with the same number of nucleons. 
This method has been used to put a lower bound on the age of our galaxy 
from the terrestrial abundance of 7°5U, which has a decay rate of 0.971 x 
10-°/yr. To avoid uncertainties in the distribution of 7°>U in earth, its 
abundance is measured relative to the isotope 7°8U, which has a slower decay 
rate of 0.154 x 10~?/yr, but behaves the same chemically and is presumably 
distributed in the same way. The initial abundance ratio of 5U to 738U is 
estimated to be 1.65 + 0.15; it is larger than one because three additional 
neutrons must be added to the progenitor of 7*°U to form the progenitor of 
?38U, On the other hand, the larger decay rate of 5U makes it (fortunately) 
less abundant than SU now. The present abundance ratio of uranium 
isotopes on earth is 0.00723, so this uranium has been decaying for a time 


In(1.65) — In(.00723) 
0.971 x 10-9/yr — 0.154 x 10-9/yr 


SO 


= 6.6 Gyr [1 Gyr = 10° yr]. 


But the sun is a second (or perhaps third) generation (called “Population I”) 
star, and presumably its uranium was being produced overalongtimeinterval 
before the formation of the solar system. The uranium abundance ratio has 
been supplemented with measurements of otherabundanceratios on theearth 
and meteorites, such as?*2Th/2?8U and !87Re/!8/Os ratios, and analyzed with 
the length of the period of heavy element formation left as a free parameter. 
This gives a more stringent (but less certain) lower bound of 9.6 Gyr! on the 
age of the heavy elements in the neighborhood of the solar system. 


1B, S. Meyer and D. N. Schramm, Astrophys. J. 311, 406 (1986). 
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A much more stringent lower bound on the age of the galaxy is given 
by applying these methods to heavy elements in metal-poor stars beyond 
the solar system. First thorium was observed spectroscopically in a very 
metal-poor star (and hence presumably old) K giant star, CS 22892-052.? 
The relative abundances in this star of the more stable elements produced in 
the r-process, as measured from the intensity of absorption lines in the star’s 
spectrum, matches those of the same elements in the solar system, except for 
a much lower abundance of the heaviest detected element thorium, which 
(for ??Th) has a half life 14 Gyr. Attributing the decrease in thorium to 
its radioactive decay, the age of the thorium in this star is estimated as 
14.1 +3 Gyr. Other estimates of the ages of CS 22892-052 and other metal- 
poor stars have been made using the measured abundance ratios of thorium 
to europium and lanthanum.? 

Uranium-238 decays more rapidly than ?3?Th, so we can get a more 
sensitive estimate of the age of a star by using both its uranium and its 
thorium abundances, providing of course that uranium as well as tho- 
rium lines can be observed in the star’s spectrum. No uranium absorp- 
tion lines were observed in the spectrum of CS 22892-052, but absorption 
lines from singly ionized uranium were subsequently observed in two other 
metal-poor star with an abundance of r-process elements, CS31082-001 and 
BD+17°3248. The observed abundance ratio of uranium to thorium in 
CS31082-001 is 10~9:74+9-!5. while the initial abundance ratio has been var- 
iously estimated as 10~°°> or 1070-10, Using these numbers in Eq. (1.8.1) 
gives this star an age of 12.5 + 3 Gyr.* Subsequent observations indicated 
ages of 14 + 2 Gyr, 15.5 +3.2 Gyr,° and 14.1 + 2.5 Gyr.’ Ina similar way, 
the age of BD+17°3248 has been calculated as 13.8 + 4 Gyr. (See Fig- 
ure 1.5.) More recently, both uranium and thorium lines have been found 
in the spectrum of the newly discovered metal-poor star HE 1523-0903; 
the ratio of thorium and uranium abundance to the abundances of other 
r-process elements, and to each other, was used to give an age of the star as 
13.2 Gyr.? 


2C, Sneden et al., Astrophys. J. 467, 819 (1996); Astrophys. J. 591, 936 (2003) [astro-ph/0303542]. A 
review with references to earlier work on thorium abundances was given by C. Sneden and J. J. Cowan, 
Astronomia y Astrofisica (Serie de Conferencia) 10, 221 (2001) [astro-ph/0008 185]. 

31. I Ivans et al., Astrophys. J. 645, 613 (2006) [astro-ph/0604180], and earlier references cited 
therein. 

4R. Cayrel et al., Nature 409, 691 (2001). 

SV, Hill et al., Astron. Astrophys. 387, 580 (2002). 

6H. Schatz et al., Astrophys. J. 579, 626 (2002). 

1S. Wanajo, Astrophys. J. 577, 853 (2002). 

8J. J. Cowan, et al., Astrophys. J. 572, 861 (2002) [astro-ph/0202429]. 

9A. Frebel et al., Astrophys. 660, L117 (2007). [astro-ph/0703414]. 
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Figure 1.5: Abundances of elements produced by the r-process in the star BD+17°3248, 
obtained by ground-based and Hubble Space Telescope spectroscopic observations. For 
comparison, the solid curve gives theoretical initial abundances, based on solar system data. 
Note the low observed abundances of thorium and uranium, compared with the theoretical 
initial abundances, which indicate an age for BD+17°3248 of 13.8 +4 Gyr. From J. J. Cowan 
et al., Astrophys. J. 572, 861 (2002) [astro-ph/0202429]. 


B. Main sequence turn-off 


The stars that satisfy the main sequence relation between absolute luminos- 
ity and surface temperature are still burning hydrogen at their core. When 
the hydrogen is exhausted at the core, hydrogen-burning continues in a shell 
around a (temporarily) inert helium core. The star then moves off the 
main sequence, toward higher luminosity and lower surface temperature. 
The heavier a star is, the more luminous it is on the main sequence, and 
the faster it evolves. Thus as time passes, the main sequence of a cluster 
of stars of different masses but the same age shows a turn-off that moves 
to lower and lower luminosities. (See Figure 1.6). Roughly, the absolute 
luminosity of stars at the turn-off point is inversely proportional to the age 
of the cluster. In particular, observations of the main sequences of a number 
of globular clusters gave ages variously calculated!? as 11.5 + 1.3 Gyr, 1241 
Gyr, 11.8 + 1.2 Gyr, 14.0 + 1.2 Gyr, 1241 Gyr, and 12.2 + 1.8 Gyr. A sum- 
mary by Schramm!! found that most of the discrepancies disappear when 


!0For references, see B. Chaboyer, Phys. Rep. 307, 23 (1998) [astro-ph/9808200]. 
1D, Schramm, in Critical Dialogues in Cosmology, N. Turok, ed. (World Scientific, Singapore, 
1997): 81 
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Figure 1.6: Color-magnitude diagram for the globular cluster M15. Visual apparent mag- 
nitudes of M15 stars are plotted on the vertical axis. Since all stars in M15 are at about 
the same distance from earth, the apparent visual magnitude differs from the absolute visual 
magnitude by a constant term, with absolute luminosities increasing upwards. The difference 
of apparent blue and visual magnitudes is plotted on the horizontal axis. This is a measure 
of surface temperature, with temperature decreasing to the right. If M15 were young, the 
main sequence would continue upwards and to the left; the position of the main sequence 
turn-off (MSTO) and other features of the diagram indicate that the age of the cluster is 
15 + 3 Gyr. Diagram from B. Chaboyer, Phys. Rep. 307, 23 (1998), based on data of P. R. 
Durrell and W. E. Harris, Astron. J. 105, 1420 (1993) [astro-ph/9808200]. 


the various calculations are done with the same input values for parameters 
like the initial abundance of helium, oxygen, and iron, and gave a consensus 
value as 14 + 2(statistical) + 2(systematic) Gyr. Note that all these ages 
are sensitive to the distance scale; a fractional change êd /d in estimates of 
distances would produce a fractional change 6L/L = —2êd /d in estimates 
of absolute luminosities, and hence a fractional change 6t/t ~ +26d/d 
in estimates of ages. Using measurements of distances to nine globular 
clusters with the Hipparcos satellite yields an estimated galactic age!* of 
13.2 + 2.0 Gyr. 


12Ẹ, Carretta, R. G. Gratton, G. Clementini, and F. F. Pecci, Astrophys. J. 533, 215 (2000) [astro- 
ph/9902086]. 
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These ages would pose a problem for what used to be the most popular 
model, with Qyy = 1 and no vacuum energy. In this case, the age of the 
universe is 


2 k M 
i 93 (= m/sec/ ee) O, 


= A 


which is somewhat younger than the oldest objects in the galaxy, though not 
by many standard deviations. Inclusion of a constant vacuum energy helps 
to avoid this problem; as remarked in Section 1.5, with nothing else in the 
universe we would have a(t) x exp(Ht), and the age of the universe would be 
infinite. As we saw in Section 1.6, the supernovae distance—redshift relation 
indicates that the vacuum energy is now roughly twice the matter energy, 
giving an age much longer than 2/3Hp: 


70 k M 
to = 13.4413 ( a 2e) Gyr, 


This removes the danger of a conflict, provided that the globular clusters in 
our galaxy are not much younger than the universe itself. In fact, there is 
now a truly impressive agreement between the age of the oldest stars and 
star clusters on one hand and the cosmic age calculated using values of Ho, 
m, and Q, found from the redshift—distance relation. As we will see in 
Section 7.2, there is also an excellent agreement between these ages and the 
age calculated using parameters measured in observations of anisotropies 
in the cosmic microwave background. 

So far in this section, we have considered only the present age of our 
own galaxy. It is also possible to estimate the ages of other galaxies at 
high redshift, at the time far in the past when the light we now observe 
left these galaxies. Of course, it is not possible to distinguish individual 
stars or globular clusters in these galaxies, but the spectrum of the galaxy 
gives a good idea of the age. We need the whole spectrum to separate the 
effects of metallicity, scattering, etc., but roughly speaking, the redder the 
galaxy, the more of its bright bluer stars have left the main sequence, and 
hence the older it is. In this way, it has been found! that the radio galaxies 
53W091(z = 1.55) and 53W069(z = 1.43) have ages ~ 3.5 Gyr and 3 to 
4 Gyr, respectively. This sets useful lower bounds on the vacuum energy. In 
a model with non-relativistic matter and a constant vacuum energy, the age 
of the universe at the time of emission of light that is seen at present with 


By 5, Dunlop et al., Nature 381, 581 (1996); J. S. Dunlop, in The Most Distant Radio Galaxies - 
KNAW Colloquium, Amsterdam, October 1997, eds. Best et al. [astro-ph/9801114]. 
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redshift z is given by Eq. (1.5.42) as 


1 

1 T d 

m= af” Z (18.2) 
Ho Jo x Qn + QKI? + Qmx? + QRx~4 


Any galaxy observed with redshift z must have been younger than this at 
the time that its light was emitted. For instance, for a flat universe with 
Qg = Qpr = 0, so that Qm = 1 — Qa, the existence of a galaxy at z = 1.55 
with age ~ 3.5 Gyr sets a lower bound! on Q, of about 0.6 for Hy = 70 
km s~! Mpc7!. 

Eventually the accuracy of these age determinations may become good 
enough to allow us to measure at least the dependence of redshift on the 
cosmic age. Of course, galaxies form at various times in the history of the 
universe, so the age of any one galaxy does not allow us to infer the age of 
the universe at the time light we now see left that galaxy. However, the 
homogeneity of the universe implies that the distribution of cosmic times of 
formation for any one variety of galaxy is the same anywhere in the universe. 
From differences in the distributions of ages of a suitable species of galaxy 
at different redshifts, we can then infer the difference of cosmic age t at these 
redshifts. The Robertson—Walker scale factor a(t) is related to the redshift 
z(t) observed now of objects that emitted light when the cosmic age was t 
by 1+ z(t) = a(to)/a(t), so Z = —H(t)(1 + z). To calculate Z, we note that 
for K = 0, H(t) = 87 Gp(t)/3, and 6 = —3H (p +p), 


HÀO = -4n G( p) +p) i (1.8.3) 


2 (5 3p 
= eLa 1.8.4 
eT G $ 2) CSN 


Thus measurements of differences in ¢ for various differences in redshift may 
allow a measurement of the ratio p/p at various times in the recent history 
of the universe. !> 


Then for K = 0 


1.9 Masses 


We saw in Section 1.6 that the observed dependence of luminosity distance 
on redshift suggests that the fraction Qa of the critical density provided by 


14L, M. Krauss, Astrophys. J. 489, 486 (1997); J. S. Alcaniz and J. A. S. Lima, Astrophys. J. 521, L87 
(1999) [astro-ph/9902298]. 
ISR. Jiminez and A. Loeb, Astrophys. J. 573, 37 (2002) [astro-ph/0106145]. 
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non-relativistic matter is roughly 30%. In this section we will consider other 
independent ways that Q yy is measured. 


A. Virialized clusters of galaxies 


The classic approach! to the measurement of Q y is to use the virial theo- 
rem to estimate the masses of various clusters of galaxies, calculate a mean 
ratio of mass to absolute luminosity, and then use observations of the total 
luminosity of the sky to estimate the total mass density, under the 
assumption that the mass-to-light ratio of clusters of galaxies is typical 
of the universe as a whole. 

To derive the virial theorem, consider a non-relativistic gravitationally 
bound system of point masses m, (either galaxies, or stars, or single parti- 
cles) with positions relative to the center of mass (in an ordinary Cartesian 
coordinate system) X,. The equations of motion are 


ey OV 
mäi = -5y (1.9.1) 
where the potential energy V is 
1 G M me 
V=--}?} 5. 1.9.2 
2 2 [Xp = Xel Á ) 


Multiplying Eq. (1.9.1) with X/ and summing over n and i gives 


> Xia T Lome ax -jp a —2T, (1.9.3) 


where T is the internal kinetic energy (not counting any motion of the center 
of mass) 


l : 
=5 XO mX}. (1.9.4) 
n 


Let us assume that the system has reached a state of equilibrium (“become 
virialized”), so that although the individual masses are moving there is no 
further statistical evolution, and in particular that 


a? A 
= Som X (1.9.5) 


Ip Zwicky, Astrophys. J. 86, 217 (1937); J. H. Oort, in La Structure et Evolution de l’ Universe 
(Institut International de Physique Solvay, R. Stoops, Brussels, 1958): 163. 
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(This is why it was important to specify that X, is measured relative to the 
center of mass; otherwise a motion of the whole cluster would give the sum 
a term proportional to 77, invalidating Eq. (1.9.5).) But V is of order —1 
in the coordinates, so the left-hand side of Eq. (1.9.3) is just V, giving the 
virial theorem: 


2T+V=0. (1.9.6) 
We may express T and V as 
1 2 1 2,1l 
T=-Mw’), V = — -GM ({-), (1.9.7) 
2 2 r 


where (v?) is the mean (mass weighted) square velocity relative to the center 
of mass, (1/r) is the mean inverse separation, and M = }_ „mnp is the total 
mass. Eq. (1.9.6) thus gives the virial formula for M: 


2 
M= 2 (v*) 
G (1/r) 


(1.9.8) 


This derivation does not apply to irregular clusters of galaxies, like the 
nearby one in Virgo. Clusters like this do not seem to have settled into a 
configuration in which the condition (1.9.5) is satisfied, and therefore prob- 
ably do not satisfy the virial theorem. On the other hand, the virial theorem 
probably does apply at least approximately to other clusters of galaxies, like 
the one in Coma, which appear more or less spherical. According to general 
ideas of statistical equilibrium, we may expect the rms velocity dispersion 
y (v?) of the dominant masses in such clusters to equal the velocity disp- 
ersion of the visible galaxies in the cluster, which can be measured from the 
spread of their Doppler shifts, and also to equal the velocity dispersion of 
the ionized intergalactic gas in the cluster, which since the advent of X-ray 
astronomy can be measured from the X-ray spectrum of the gas. The values 
obtained in these ways for (v?) are independent of the distance scale. On the 
other hand, values for (1/r) are obtained from angular separations: the true 
transverse proper distance d is given in terms of the angular separation 6 by 
d = 6dy, where d4 is the angular diameter distance (1.4.11). For clusters 
with z < 1, Eqs. (1.4.9) and (1.4.11) give d4 ~ z/Họ, sod ~ 0z/Ho . Thus 
the estimated values of (1/r) for galaxy clusters with z « 1 scale as Ho, 
and the values of M inferred from Eq. (1.9.8) scale as 1/Ho. The absolute 
luminosity L of a cluster of galaxies with redshift z and apparent luminosity 
£ is given for z < 1 by Eqs. (1.4.2) and (1.4.9) as L = 47z?£/H2, so the 
values of L scale as Hy 2 and the mass-to-light ratios obtained in this way 
therefore scale as Hy l /Ho 2 _— Ho. 
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Estimates of M /L for rich clusters have generally given results of order 
200 to 300 h Mo/Lo, where h is the Hubble constant in units of 100 km 
s7! Mpe~!, and Mo and Lo are the mass and absolute luminosity of the 
sun. For instance, a 1996 study? of 16 clusters of galaxies with redshifts 
between 0.17 and 0.55 gave M/L = (295 + 53)hMo/Lo. Some of the 
same group? have corrected this result for various biases, and now find 
M/L = (213+ 59)hMo/Lo. A more recent application* of the virial 
theorem to 459 clusters has found a value M/L ~ 348 h Mo/Lo. 

All these values of M /L for clusters of galaxies are very much larger than 
the mass-to-light ratios of the visible regions of individual galaxies.> The 
mass-to-light ratios of individual elliptical galaxies can be measured using 
the virial theorem, with v < v2 > taken as the velocity dispersion of stars 
contained in the galaxy; this gives mass-to-light ratios generally in the range 
of 10 to 20 h Mo/Lo. All of the visible light from clusters comes from their 
galaxies, so we must conclude that most of the mass in clusters of galaxies 
is in some non-luminous form, either in the outer non-luminous parts of 
galaxies or in intergalactic space. It has been argued that this mass is in large 
dark halos surrounding galaxies, extending to 200 kpc for bright galaxies.’ 
The nature of this dark matter is an outstanding problem of cosmology, to 
which we will frequently return. 

Incidentally, the large value of M/L given by the virial theorem for 
elliptical galaxies shows that most of the mass of these galaxies is not in 
the form of stars as bright as the sun. It is harder to estimate M/L for 
spiral galaxies, but since the work of Vera Rubin® it has been known that 
most of their mass is also not in luminous stars.’ If most of the mass of a 
spiral galaxy were in the luminous central regions of the galaxy, then the 
rotational speeds of stars outside this region would follow the Kepler law, 
v xr !/?, Instead, it is observed that v outside the central region is roughly 
constant, even beyond the visible disk of the galaxy, which is what would 
be expected for a spherical halo with a mass density that decreases only as 
1/r?, in which case most of the mass of the galaxy would be in the dark outer 


2R.G. Carlberg et al., Astrophys. J. 462, 32 (1996). 

3R. G. Carlberg, H. K. C. Yee, and E. Ellingson, Astrophys. J. 478, 462 (1997). 

1H, Andernach, M. Plionis, O. López-Cruz, E. Tago, and S. Basilakos, Astron. Soc. Pacific Conf. 
Ser. 329, 289 (2005) [astro-ph/0407098]. 

>This conclusion was first reached in a study of the Coma cluster by F. Zwicky, Helv. Phys. Acta 6, 
110 (1933). 

6T. R. Lauer, Astrophys. J. 292, 104 (1985); J. Binney and S. Tremaine, Galactic Dynamics (Princeton 
University Press, Princeton, 1987). 

TN. A. Bahcall, L. M. Lubin, and V. Dorman, Astrophys. J. 447, L81 (1995). 

8V, C. Rubin, W. K. Ford, and N. Thonnard, Astrophys. J. 225, L107; 238, 471 (1980). 

9M. Persic and P. Salucci, Astrophys. J. Supp. 99, 501 (1995); M. Persic, P. Salucci, and F. Stel, Mon. 
Not. Roy. Astron. Soc. 281, 27P (1996). 
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parts of the halo. There is some evidence from the absence of gravitational 
microlensing by the halo (discussed in Section 9.2) that this mass is not in 
the form of dark stars either, but it is still possible that most of the matter in 
galaxies is baryonic. We will not go into this in detail, because the formation 
of galaxies involves cooling processes that requires baryonic matter of the 
same sort as in stars, so that we would only expect the value of M/L for 
galaxies to be similar to the value for the universe as a whole if the matter 
of the universe were mostly baryonic. 

In using the value of M /L derived from the virial theorem for clusters of 
galaxies to find the mass density of the universe, we cannot just add up the 
luminosity per volume of clusters, because most of the light of the universe 
comes from “field” galaxies that are not in clusters. Instead, if we assume 
that the field galaxies are accompanied by the same amount of dark matter 
as the galaxies in clusters, as argued in ref. 7, then we can find Qy by 
using the value of M /L for clusters together with an estimate of the total 
luminosity density £ to estimate the total mass density as 


pu =(M/L)L. (1.9.9) 


Since values of absolute luminosities inferred from apparent luminosities 
and redshifts scale as Hy 2 and distances inferred from redshifts scale as 
Hy | the total luminosity density of the universe calculated by adding up the 
absolute luminosities of galaxies per volume scales as Ho 2} (Hy D3 = Ho. 
For example, a 1999 estimate!’ gave £ = 2 0.2 x 108 h Lo Mpc™?. For 
the purpose of calculating Qm it is more convenient to write this as a ratio 
of the critical mass density to the luminosity density: 


powrit/L = (1390 + 140) h Mo/Lo - 


(Here we use Mo = 1.989 x 1033 g, 1 Mpc = 3.0857 x 10°% cm, and 
Port = 1.878 x 107? A? g/cm3.) Taking M/L = (213 + 53)hMo/Lo 
gives then 
M/L 

Po,crit / £ 
with the first uncertainty arising from £ and the second from M/L. Itis 
important to note that this is independent of the Hubble constant, as both 
LM /L and po,crit scale as Hy. 

This estimate of Qm is somewhat lower than those derived from the 
redshiftuminosity relation of supernovae and from the anisotropies in the 


Qu = 0.15 0.02 + .04 , 


105S, Folkes et al., Mon. Not. Roy. Astron. Soc. 308, 459 (1999); M. L. Blanton et al., Astron. J. 121, 
2358 (2001). 
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cosmic microwave background, to be discussed in Section 2.6 and Chapter 7. 
But all these estimates agree that Q m is distinctly less than unity. 


B. X-ray luminosity of clusters of galaxies 


Does the dark matter in clusters of galaxies consist of ordinary nuclei and 
electrons? We can find the ratio of the fraction Qzg of the critical density 
provided by baryonic matter (nuclei and electrons) to the fraction Qm pro- 
vided by all forms of non-relativistic matter by studying the X-rays from 
clusters of galaxies, for it is only the collisions of ordinary baryonic parti- 
cles that produces these X-rays. Because these collision processes involve 
pairs of particles of baryonic matter, the absolute X-ray luminosity per unit 
proper volume takes the form 


fya A(Ts) pp. (1.9.10) 


where Tg and pz are the temperature and density of the baryonic matter, and 
A(T) is a known function of temperature and fundamental constants. The 
baryonic density satisfies the equation of hydrostatic equilibrium, which 
(assuming spherical symmetry) follows from the balance of pressure and 
gravitational forces acting on the baryons in a small area A and between 
radii r and r + ôr: 


Ad G f" 
A(ps + ôr) - ps()) = aa 4n par (r) dr , 


or, canceling factors of A and êr and using the ideal gas law pg = kg 
TspB/mB, 


47r? pu (r) dr, 


d (keTa)ppr)\ Ges) f" 
dr mB r2 0 
where pm (r) is the total mass density, kg is Boltzmann’s constant, mp is a 
characteristic mass of the baryonic gas particles, and r is here the proper 


distance to the center of the cluster. Multiplying by r?/pg(r) and differen- 
tiating with respect to r yields 


d| ° d (eee 
dr | pg(r) dr mB 


) =—4rGrpy(r). (1.9.11) 


If we make the assumption that cold dark matter particles, or whatever 
particles dominate the dark intergalactic matter, have an isotropic velocity 
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distribution (which is not very well motivated), then the same derivation 
applies to these particles, and their density pp = pm — pp satisfies the 
non-linear differential equation 


Al °? d (= To(r)pp(r) 


2 

alea T ) = —4r Gr'pm (r). (1.9.12) 
where Tp(r) and mp are the temperature and mass of the dark matter parti- 
cles. With perfect X-ray data and a knowledge of the distance of the source, 
one could measure the X-ray luminosity density Ly (r) and (using the X- 
ray spectrum) the baryon temperature Tg(r) at each point in the cluster, 
then use Eq. (1.9.10) to find the baryon density pg(r) at each point, and 
then use Eq. (1.9.11) to find the total mass density at each point. We could 
then calculate the fractional baryon density pg/oy, and if we were interested 
we could also use Eq. (1.9.12) to find the velocity dispersion kgTp(r)/mp 
of the dark matter. 

In practice, it is usually necessary to use some sort of cluster model. In 
the simplest sort of model, one assumes an isothermal sphere: the tempera- 
tures Tg and Tp are taken to be independent of position, at least near the 
center of the cluster where most of the X-rays come from. It is also often 
assumed that the same gravitational effects that causes the concentration of 
the hot intergalactic gas in the cluster is also responsible for the concentra- 
tion of the dark matter, so that the densities og(r) and pm (r) are the same, up 
to a constant factor, which represents the cosmic ratio Qg/Qjy of baryons 
to all non-relativistic matter. (These gravitational effects are believed to 
be a so-called “violent relaxation,”!! caused by close encounters of clumps 
of matter whose gravitational attraction cannot be represented as an inter- 
action with a smoothed average gravitational field. The condensation of 
galaxies out of this mixture requires quite different cooling processes that 
can affect only the baryonic gas, which is why galaxies have a lower pro- 
portion of dark matter and a lower mass-to-light ratio.) Comparison of 
Eqs. (1.9.11) and (1.9.12) shows that pg(r) and pp(r) will be proportional 
to each other, and hence also to pm (r) if the velocity dispersions of the dark 
matter and hot baryonic gas are the same: 


kgTu/mm = kgTp/mp = o°. (1.9.13) 
Equations (1.9.11) and (1.9.12) both then tell us that 


eu (r) = pm (0) F (r/ro) (1.9.14) 


11D, Lynden-Bell, Mon. Not. Roy. Astron. Soc. 136, 101 (1967). 
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where F'(0) = 1; ro is a core radius, defined conventionally by 


902 
Sr e 1.9.15 
"=y Goum cee 
and F (u) is a function satisfying the differential equation 
d { w dF(u) 5 
= —9u F (u) . 1.9.16 
du (75 du ey) ( ) 


We must also impose the boundary condition that pom is analytic in the 
coordinate X at X = 0, which for a function only of r means that it is 
given near r = 0 by a power series in r°, so that F(u) is given near u = 0 
by a power series in u?, F(u) = 1 + O(u?). Together with this boundary 
condition, Eq. (1.9.16) defines a unique function!” that for small u has the 
approximate behavior! 


F(u) = d+)?” (1.9.17) 


The solution to Eq. (1.9.16) is shown together with the approximation 
(1.9.17) in Figure 1.7. 

For large u, F (u) approaches the exact solution 2/9u?. Taken literally, 
this would make the integral for the total mass diverge at large r, which 
shows that the assumption of constant o? must break down at some large 
radius. Often the function F (u) is taken simply as!4 


Fu) = (14+)? , 
where £ is an exponent of order unity. 


F(u) 

1 
0.8 
0.6 
0.4 
0.2 


u + u 
05 1 15 2 25 3 4 5 67 8 9 10 


Figure 1.7: The solution to Eq. (1.9.16) (solid line) and the approximation (1.9.17) (dashed 
line). For the lower values of u in the figure at the left, the two curves are indistinguishable. 


12For a tabulation of values of F (u), see e. g. J. Binney and S. Tremaine, Galactic Dynamics (Princeton 
University, Princeton, 1987): Table 4.1. 

BI, R. King, Astron. J. 67, 471 (1962). 

14A. Cavaliere and R. Fusco-Fermiano, Astron. Astrophys. 49, 137 (1976). 
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Also, Eq. (1.9.11) has the solution 
pB(r) = pBO)F(r/ro) , (1.9.18) 


with the same function F(u) and the same core radius rp. We can measure 
the core radius from the X-ray image of the cluster, and measure o? from 
the X-ray spectrum, so that Eq. (1.9.15) can be used to find the central 
density om (0) of all non-relativistic matter. The central density og(0) of 
the baryonic matter can then be found from the total X-ray luminosity, 
which with these approximations (and using Eq. (1.9.10)) is 


Ha J dx Ly = 4r A(Tp) R pp OL, (1.9.19) 
where 
T= Í uF’ (w) du . (1.9.20) 
0 


Even though the solution of Eq. (1.9.16) gives an infinite mass, it gives a 
finite total X-ray luminosity, with Z = 0.1961. (The approximation (1.9.17) 
would give T = 2/16 = 0.1963.) 

For a cluster at redshift z, the core radius ro inferred from observation of 
the angular size of the cluster will be proportional to the angular diameter 
distance d4(z), while the temperature and velocity dispersion found from 
the X-ray spectrum will not depend on the assumed distance. Thus the 
value of the central total matter density pom (0) given by Eq. (1.9.15) will be 
proportional to 1 /d3 (z). On the other hand, the absolute X-ray luminos- 
ity Ly inferred from the apparent X-ray luminosity will (like all absolute 
luminosities) be proportional to the value assumed for d? (z), the square of 
the luminosity distance, so with rọ « d4, the central baryon density og(0) 
given by Eq. (1.9.19) will be proportional to [d? (z) /d3 (z)]'/?. The value of 
the ratio of central densities inferred from observations of a given cluster at 
redshift z will therefore have a dependence on the distance assumed for the 
cluster given by 


pB(0) 
pm (0) 


in which we have used the relation (1.4.12) between luminosity and angular 
diameter distances. 

Forz « 1, we have d4(z) ~ dz(z) =~ z/Họo, and so according to 
Eq. (1.9.21) the value of og(0)/pm (0) obtained from observations of clus- 
ters of small redshift will be proportional to the assumed value of Hy we 
It is believed that most of the baryonic mass in a cluster of galaxies is in the 


x dr (Dd!) = A + 2)°d3@), (1.9.21) 
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hot gas outside the galaxies, and if we suppose that this mass is the same 
fraction of the total mass as in the universe as a whole,!> then we should get 
the same value of og(0)/pm (0), equal to Qg/Q4y, for all clusters, whatever 
value we assume for Ho, but this value of Qg/Q yy will be proportional to 
the assumed value of Hy 3/2, For example, Schindler!® quotes various stud- 
ies that give og(0)/om (0) as 0.14, 0.11, 0.12, and 0.12 for Ho = 65 km s7! 
Mpc™!, so if we take the average 0.12 of these values as the cosmic value of 
Qp/Qy for Ho = 65 km s7! Mpc™!, then for a general Hubble constant 
we find 


g/m x 0.06 h? , (1.9.22) 


where h as usual is Hubble’s constant in units of 100 km s~! Mpe~!. We 
can thus conclude pretty definitely that only a small fraction of the mass in 
clusters of galaxies is in a baryonic form that can emit X-rays. 

On the other hand, when we study clusters with a range of redshifts that 
are not all small, we will not get a uniform value of og(0)/pm (0) unless 
we use values of d4(z) with the correct dependence on z. As remarked in 
Section 1.6, observations of clusters have been used in this way to learn 
about the z-dependence of d4(z). 

It should be mentioned that computer simulations that treat galaxy clus- 
ters as assemblages of collisionless particles do not show evidence for a 
central core,!7 but instead indicate that the dark matter density at small 
distances r from the center should diverge as r7! to r~3/?. On the other 
hand, it has been shown! that the density of a baryonic gas in hydrostatic 
equilibrium in the gravitational field of such a distribution of dark mat- 
ter does exhibit the core expected from Eq. (1.9.18). In any case, the dark 
matter and baryonic gas densities do have the same distributions at 
distances from the center that are larger than ro. 

As we will see in Section 3.2, it is possible to infer a value for Q Bh? from 
the abundances of deuterium and other light isotopes, which together with 
Eq. (1.9.22) can be used to derive a value for Qyh'/?. There are several 
other methods for estimating Qu or Q mh? that will be discussed elsewhere 


IS This is argued by S. D. M. White, J. F. Navarro, A. E. Evrard, and C. S. Frenk, Nature 366, 429 
(1993). Calculations supporting this assumption are described in Section 8.3. 

16S, Schindler, in Space Science Reviews 100, 299 (2002), ed. P. Jetzer, K. Pretzl, and R. von Steiger 
(Kluwer) [astro-ph/0107028]. 

17], F, Navarro, C. S. Frenk, and S. D. M. White, Astrophys. J. 462, 563 (1996) [astro-ph/9508025]; 
490, 493 (1997) [astro-ph/9610188]; T. Fukushige and J. Makino, Astrophys. J. 477, L9 (1997) [astro- 
ph/9610005]; B. Moore et al., Mon. Not. Roy. Astron. Soc. 499, L5 (1998). 

18N. Makino, S. Sasaki, and Y. Suto, Astrophys. J. 497, 555 (1998). Also see Y. Suto, S. Sasaki, and 
M. Makino, Astrophys. J. 509, 544 (1998); E. Komatsu and U. Seljak, Mon. Not. Roy. Astron. Soc. 
327, 1353 (2001). 
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in this book, using gravitational lenses (Section 9.3), the Sunyaev—Zel’dovich 
effect (Section 2.5), and anisotropies in the cosmic microwave background 
(Sections 2.6 and 7.2), the last of which also gives a value for Qgh*. In 
addition to these, there are methods! based on the evolution of clusters 
of galaxies, cosmic flows, cluster correlations, etc., that depend on detailed 
dynamical theories of structure formation. 


1.10 Intergalactic absorption 


Some of the cosmic gas of nuclei and electrons from which the first galaxies 
and clusters of galaxies condensed must be still out there in intergalactic 
space. Atoms or molecules in this gas could be observed through the res- 
onant absorption of light or radio waves from more distant galaxies or 
quasars, but it is believed that most of the gas was ionized by light from 
a first generation of hot massive stars, now long gone, that are sometimes 
called stars of Population III. It now appears that some quasars formed 
before this ionization was complete, giving us the opportunity to observe 
the intergalactic gas through resonant absorption of the light from these 
very distant quasars. 

Let us suppose that an atomic transition in a distant source produces a 
ray of light of frequency vı that leaves the source at time t; and arrives at 
the Earth with frequency vo at time fo. At time ¢ along its journey the light 
will have frequency redshifted to v,a(t1)/a(t), so if the intergalactic medium 
absorbs light of frequency v at a rate (per proper time) A (v, t), and does not 
emit light, then the intensity 7 (t) of the light ray will decrease according to 
the equation 


i= -A (viata, r) I(t). 


But if the intergalactic gas is at a non-zero temperature T (t), then photons 
will also be added to the light ray through the process of stimulated emission, 
as a rate per photon given by the Einstein formula! exp (—hv/kgT) AQ, t), 
so the intensity of the light ray will satisfy 


. hvia(tı) 
y= i — exp (- a A (viata, r) I(t) (1.10.1) 


The intensity observed at the earth will then be 
I(to) = exp(—t)I(t1) , (1.10.2) 


19 For surveys, see N. A. Bahcall, Astrophys. J. 535, 593 (2000) [astro-ph/0001076]; M. Turner, Astro- 
phys. J. 576, L101 (2002) [astro-ph/0106035]; S. Schindler, op. cit.; K. A. Olive, lectures given at Theoret- 
ical Advanced Study Institute on Elementary Particle Physics, Boulder, June 2002 [astro-ph/0301505]. 

1A. Einstein, Phys. Z. 18, 121 (1917). 
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where t is the optical depth: 


= u hvia(tı) 
E [1-2 (areg) | aoao de. 0103 


The absorption rate is given by 
Av, t) =n(tho(), (1.10.4) 


where o (v) is the absorption cross section at frequency v, and n(f) is the 
number density (per proper volume) of absorbing atoms. Often the absorp- 
tion cross section is sharply peaked at some frequency vp, so the absorption 
takes place only close to a time tr, given by 


a(tr) = vja(t1)/vR. (1.10.5) 


Therefore the optical depth can be approximated as 


t ~n(tr) [1 — exp ( = hve/keT (te) | J o (viata) dt . 


By changing the variable of integration from time to frequency, we can write 
this as 


t = n(tR) [i — exp ( 2 hvr/ksT(tr))| [a(tr)/a(tr)| Ze , (1.10.6) 


where 
1 
Tr= = fewa, (1.10.7) 
VR 


the integral being taken over a small range of frequencies containing the 
absorption line. The only thing in the formula for t that depends on a 
cosmological model is the Hubble expansion rate å(tr)/a(tpr) at the time of 
absorption, given by Eq. (1.5.19) and (1.5.38) as 


a(tr) 


nF HoV 2a + QK(1 + zR)? + Qu (1 +R +9R +2R)4, 
R 


(1.10.8) 


where zę = a(to)/a(tr) — 1 = vr/vo — l is the redshift of the location of the 
resonant absorption. For a source at redshift z, the absorption takes place 
over a range of observed frequencies vp = v, /(1 + z) given by the condition 
that the time tr defined by Eq. (1.10.5) should be between ¢; and to: 


vR/(1 +2) < vw <vR. (1.10.9) 
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For example, in 1959 Field? suggested looking for the effects of abs- 
orption of radio frequencies in the 21 cm transition in hydrogen atoms, 
caused in transitions from the spin zero to spin one hyperfine states in the 1s 
state of intergalactic hydrogen. Here vr = 1420 MHz, so the radio spectrum 
of the galaxy Cygnus A at a redshift z = 0.056 should show an absorp- 
tion trough (1.10.9) from 1342 MHz to 1420 MHz. Unfortunately, the 
temperature of neutral hydrogen in intergalactic space is much larger than 
hvr/kg = 0.068K, so the optical depth (1.10.6) is suppressed by a factor 
x 0.068 K/T (tr). No sign of this absorption trough has been discovered. 
It is hoped that in the future a new generation of low frequency radio tel- 
escopes with good angular resolution may be able to use the emission and 
absorption of 21 cm radiation at large redshifts to study both the growth 
of structure and primordial density perturbations from which they grew. 
For instance, by 2010 the Low Frequency Array (LOFAR) should be able to 
study 21 cm radiation from sources at redshift between 5 and 15 with good 
sensitivity and high angular resolution.* 

For the present, a much better probe of intergalactic hydrogen atoms is 
provided by absorption of photons in the Lyman g transition from the 1s 
ground state to the 2p excited state, known as the Gunn—Peterson effect.’ 
This has a resonant frequency in the ultraviolet, ve = 2.47 x 10!5 Hz, 
corresponding to a wavelength 1,215 A, but for a source of redshift z > 1.5 
the lower part or the absorption trough (1.10.9) will be observable on the 
Earth’s surface at wavelengths greater than 3,000 A, in the visible or infrared 
part of the spectrum. Here Ava/kg = 118,000 K, which is likely to be larger 
than the temperature of the intergalactic medium, in which case the factor 


1 — exp ( — hvr/keT(tr)) in Eq. (1.10.6) can be set equal to unity. The 


integral (1.10.7) here has the value 4.5 x 10718 cm?, so Eq. (1.10.6) gives the 
optical depth just above the lower end of the absorption trough (1.10.9) as 


n(tr) 
Tygp=vr/(U4z)+ = (z 7 10o =) (2a + Qg (1 +2) 
3 art? 
HMU ERUH), (1.10.10) 


where again h is Hubble’s constant in units of 100 km s7! Mpc™!. For 
instance, ifa fraction f of the baryons of the universe at a time corresponding 


2G. Field, Astrophys. J. 129, 525 (1959). 

3A. Loeb and M. Zaldarriaga, Phys. Rev. Lett. 92, 211301 (2004) [astro-ph/0312134]; S. Furlanetto, 
S. P. Oh, and F. Briggs, Phys. Rep. 433, 181 (2006) [astro-ph/0608032]. 

4H. J.A. Röttgering et al., in Cosmology, Galaxy Formation, and Astroparticle Physics on the Pathway 
to the SKA, eds. H.-R. Klöckner et al. [astro-ph/0610596]. 

5J. E. Gunn and B. A. Peterson, Astrophys. J. 142, 1633 (1965). Also see I. S. Shklovsky, Astron. Zh. 
41, 408 (1964); P. A. G. Scheuer, Nature 207, 963 (1965). 


77 


1 The Expansion of the Universe 


to z = 5 were in the form of neutral intergalactic hydrogen atoms, and 
Ngh? = 0.02, then the number density of hydrogen atoms at z = 5 would 
be 4.8f x 1075 cm™?. Taking h = 0.65, Qn = 0.7, Qm = 0.3, and 
Qe = Qpr = 0, the optical depth (1.10.10) would be 3.8f x 10°. Thus 
with these parameters intergalactic neutral hydrogen that makes up a frac- 
tion of baryonic matter f >> 2.6 x 1076 would completely block any light 
with a frequency above the redshifted Lyman « line from sources beyond 
z = 5. Evidently the Gunn—Peterson effect provides a very sensitive probe 
of even a small proportion of neutral hydrogen atoms. 

For many years the search for the Lyman « absorption trough was 
unsuccessful. Quasar spectra show numerous Lyman « absorption lines, 
forming what are sometimes called “Lyman a forests,” which are believed 
to arise from clouds of neutral hydrogen atoms along the line of sight, but 
for quasars out to z ~ 5 there was no sign of a general suppression of 
frequencies above the redshifted Lyman a frequency,° that would be pro- 
duced by even a small fraction f of the baryons in the universe in the form 
of neutral intergalactic hydrogen atoms. Then in 2001 the spectrum of the 
quasar SDSSp J103027.10+052455.0 with redshift z = 6.28 discovered by 
the Sloan Digital Sky Survey was found to show clear signs of a complete 
suppression of light in the wavelength range from just below the redshifted 
Lyman a wavelength at 8,845 A down to 8,450 A, indicating a significant 
fraction f of baryons in the form of neutral intergalactic hydrogen atoms at 
redshifts greater than 8,450/1,215 — 1 = 5.95.7 (See Figure 1.8.) Thus a 
redshift of order 6 may mark the end ofa “dark age,” in which the absorption 
of light by neutral hydrogen atoms made the universe opaque to light with 
frequencies above the redshifted Lyman « frequency. Further evidence for 
this conclusion is supplied by the spectrum of intense gamma ray sources, 
known as gamma ray bursters, at large redshifts.® 

This does not mean that all or even most of the hydrogen in the universe 
was in the form of neutral atoms at z > 6. As we have seen, even small con- 
centrations of neutral hydrogen could have produced an absorption trough 
in the spectrum of distant quasars. In fact, we shall see in Chapter 7 that 
there is now some evidence from the study of the cosmic microwave back- 
ground that hydrogen became mostly ionized at redshifts considerably larger 
than z ~ 6, perhaps around z ~ 10. 


6A, Songalia, E. Hu, L. Cowie, and R. McMahon, Astrophys. J. 525, L5 (1999). 

TR. H. Becker et al., Astron. J. 122, 2850 (2001) [astro-ph/0108097]. See S. G. Djorgovski et al., 
Astrophys. J. 560, L5 (2001) [astro-ph/0108069], for a hint of absorption by neutral hydrogen at slightly 
smaller redshifts. Also see X. Fan et al., Astrophys. J. 123, 1247 (2002) [astro-ph/0111184]. 

8T. Totani et al., Publ. Astron. Soc. Pacific 58, 485 (2006) [astro-ph/05121 54]. 
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The clouds of neutral hydrogen at redshifts z < 6 which produce the 
Lyman a forest can provide an independent means of measuring Qq and 
Q4. The idea goes back to a 1979 paper of Alcock and Paczynski.? Suppose 
we observe a luminous object at a redshift z that extends a proper distance 
Dı perpendicular to the line of sight and a proper distance Dj along the 
line of sight. According to the definition of the angular diameter distance, 
the object will subtend an angle 


AO = Di /d4(z). (1.10.11) 


Also, when we observe light from the whole object at the same time fo, the 
difference in the time f, that the light was emitted from the far and near 
points of the object will be At; = Dy. The redshift is a(to)/a(t1) — 1, so the 
absolute value of the difference of redshift from the far and near points of 
the object will be 


_ ato) 
a(t) 


where H(z) = å(tı)/a(tı) is the Hubble constant at the time of emission. 
Taking the ratio, we have 


a(t))At) =(1+2)H(2)D,, (1.10.12) 


A 
a = (1+2 H(z) da(z)(Dj/D.) (1.10.13) 


It is then only necessary to use Eq. (1.5.19) to write H(z) as 


8G K 
H(z) = (3) (omod +2)? + py + pro(l +24) -50 +2? 
0 


= HV 2u (1 +2 + Qa + 2R +244 2A +2)2, (1.10.14) 
and use Eqs. (1.4.12) and (1.5.45) to write d4 (z) as 


1 
1/2 


da2) =———__ 
(1 + 2) HQ}! 


x sinh ay? f u 
K Jia+ XV OA + KX? + Qmx? + Qex-4 
(1.10.15) 


°C. Alcock and B. Paczyński, Nature 281, 358 (1979). 
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The Hubble constant Ho cancels in the product, and we find a result that 
depends only on z, D)/D ,, and the Qs: 


Az 


Z = (Di/D1) 2K? VQ +2 + Qa + ORO 2) + OKC + zy 


x sinh af A 
K Jya x2/Q4 + KX? + Qmx? + Qpx~4 
(1.10.16) 


For instance, if the object is known to be a sphere, such as a spherical cluster 
of galaxies, then D)/D, = 1, and we can use a measurement of Az and A0 
to set a model-independent constraint on the Qs, with no need to worry 
about effects of evolution or intergalactic absorption. 

Unfortunately, it is not so easy to find spherical objects at large redshift. 
But there are various objects whose distribution functions are spherically 
symmetric. For instance, the distribution of field galaxies is presumably 
spherically symmetric about any point in space, and it has been proposed 
that the application of the Alcock—Paczynski method to galaxies might 
allow a determination of the cosmological constant.!? This method has 
been applied!! instead to the distribution of quasars measured in the 2dF 
QSO Redshift Survey.!? Assuming K = 0, this analysis gives Q4 = 
0.7109. 

Recently the Alcock—Paczynski idea has been applied to the distribution 
function of Lyman a clouds.!? As already mentioned, these are intergalactic 
clouds containing neutral hydrogen atoms, which absorb light from more 
distant quasars along the line of sight in 1s — 2p transitions, showing up as 
dark lines in the spectrum of the quasar at wavelengths 1215 (1 + z) A for 
clouds at redshift z. Suppose we measure the number density N (z, ñ) of 
Lyman « clouds at various redshifts z in various directions ñ. Assuming a 
spherically symmetric distribution of Lyman « clouds, the mean value of the 
product of the number densities of these clouds at two nearby points with 
redshifts z and z+ Az (with Az < 1) and directions ħ and i+ An separated 
by a small angle A90 will be a function only of z and the proper distance 
between the points, and will be analytic in the components of the vector 


10W, E, Ballinger, J. A. Peacock, and A. F. Heavens, Mon. Not. Roy. Astron. Soc. 281, 877 (1996). 

‘lp J, Outram et al., Mon. Not. Roy. Astron. Soc. 348, 745 (2004) [astro-ph/03 10873]. 

125, M. Croom et al., Mon. Not. Roy. Astron. Soc. 349, 1397 (2004); available at www. 2d£ 
quasar.org. 

13L, Hui, A. Stebbins, and S. Burles, Astrophys. J. 511, L5 (1999); P. McDonald and J. Miralda- 
Escudeé, Astrophys. J. 518, 24 (1999); W-C. Lin and M. L. Norman, talk at the Theoretical Astrophysics 
in Southern California meeting, Santa Barbara, October 2002 [astro-ph/0211177]; P. McDonald, Astro- 
phys. J. 585, 34 (2003). 
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Figure 1.8: Observed intensity versus wavelength for four high-redshift quasars, from 
R. H. Becker et al., Astron. J. 122, 2850 (2001) [astro-ph/0108097]. Vertical dashed lines 
indicate the redshifted wavelengths for various spectral lines. In the direction of the quasar 
with z = 6.28 the intensity drops to zero within experimental accuracy just to the left of the 
Lyman a line at 8845 Å, a feature not seen for the quasar with z = 5.99, indicating the onset 
of patches of nearly complete ionization at a redshift between 5.99 and 6.28. 
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separating these components, so for small separations it can be written 
2 2 


X A AL IPA Z 
(N(z,n) N (z+ Az, n+ An)) ~ (N*(z,n)) [ LO 


l , (1.10.17) 


where D1 and Dy are given by Eqs. (1.10.11) and (1.10.12), and L is some 
correlation length. This can be written in terms of the observed Az and 
A0, as 


, ee NE l Az? | 
(N(z,n) N(z+ Az,n+ An)) > (N*(z,n)) | 1 : 


PRO LO 
(1.10.18) 

where L- and Le are correlation lengths for redshift and angle 

L(z) 

d4(z) ` 


By measuring this product for various redshifts and directions, we can infer 
a value for the ratio of correlation lengths, which is independent of L: 


Lo) = Lz) =L(z)\1+2)H(z). (1.10.19) 


L. a 
LG = oe” JQud +234+ 2, +9R 4$2)4+ 2x0 +2)? 
o 


5 1/2 dx 
x sinh | Q / i 
l i 1/(1+z) x2,/QA + Qex72+Qyx3 + | 
(1.10.20) 


This method has been applied!* to five pairs of close quasars, with red- 
shifts in the range from 2.5 to 3.5 and separations ranging from 33 to 
180 arcseconds. Use of this limited sample sets only weak constraints on 
the Qs, but it rules out Q4 = 0 at the level of 2 standard deviations. 


1.11 Number counts 


A uniform distribution of sources with a smooth distribution of absolute 
luminosity leads in ordinary Euclidean space to a unique distribution in 
apparent luminosity. If there are N(L)d~ sources per unit volume with 
absolute luminosity between L and L + dL, then the number n(> £) of 


14A. Lidz, L. Hui, A. P. S. Crotts, and M. Zaldarriaga, astro-ph/0309204 (unpublished). 
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sources observed with apparent luminosity greater than £ is given by 


JL/Ant 
Í Amr? dr 


n(> £) = f Noar 
0 


1 [0,0] 
= — L3/? N(L) dL (1.11.1) 
3/4 03/2 I 
Thus whatever the distribution in absolute luminosity, we expect that 
n(> £) x 73/2, 
This analysis needs several changes in a cosmological setting: 


0 


1. Instead of the volume element r° sin 6 dr d do, the proper volume 
element here is (Det g®)!/2dr d0 dø, where a = ag is the 


three-dimensional metric, with non-vanishing components gO = 
a’/(1 — Kr’), rao = ar’, gi} = ar? sin? 0, so 


E @ (t)r? sin@ dr dé do 


dV (1.11.2) 
V1 — Kr? 
2. The apparent luminosity is related to the absolute luminosity by 
L 
Cs (1.11.3) 
Amd; (z) 


where dg (z) is the luminosity distance (1.4.3). 


3. Except in the steady state cosmology, the number density of sources 
changes with time, even if only through the cosmic expansion. 


4. We can often measure the redshift z as well as the apparent lumino- 
sity. 


Eq. (1.11.2) gives the number of sources with redshift between z and 
z + dz and apparent luminosity between £ and £ + dé as 
a>(t) r2 dr 


— Kr2 


n(z, © dz dt = 4nN(t, L)dL (1.11.4) 


where N (t, L) dL is the number of sources per proper volume at time t 
with absolute luminosity between L and L + dL; t and z are related by 
1+ z= a(to)/a(t), and t and r are related by Eq. (1.2.2): 


to dt! r dr’ 
= am 1.11.5 
f a(t’) Í 1 — Kr’? Á ) 
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We use (1.11.5) to express the differential dr in terms of dt, and then express 
dt in terms of dz: 


dr = dts az 
VIZEK? a(t) Hew’ 
where H(z) = å(t)/a(t) and ag = a(tọ). As a reminder, for a universe 


containing radiation, matter, and a constant vacuum energy, Eq. (1.5.41) 
gives 


H(z) = HoV Qa + Qk +2)? + Qu +z) + QR +z). 
Canceling dz in Eq. (1.11.4), we then have 
Ar N(1©,L) Pag dL 
(d +z)? A(z) 
We next use Eq. (1.11.3) to write (with z now held fixed): 


n(z, £) de = 


> 


dL = 4nd;7(z) dt, 
so that canceling dé gives 


16? N (16), 4rd? E£) dE) 
HE) (+2) 


n(z, £) = ‘ (1.11.6) 
in which we have used Eq. (1.4.3) to express aor in terms of dz. 

In particular, for a sample of sources that are not evolving at a time 
t(z), the time dependence of the number density M is just proportional to 
a> x (1 +z): 


N(t@),L) =e NGC: (1.11.7) 
If all members of this sample are bright enough to be visible at a redshift 


z, then the total number of sources observed with redshifts between z and 
z + dz will be n(z) dz, where 


a is _ 4a Nod} 2) 
where dg (z) is given by Eq. (1.5.45), and 
No =| No(L) dL . (1.11.9) 
0 
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In principle, even without knowing No or Ho, if n(z) were accurately 
measured we could compare the observed shape of this function with 
Eq. (1.11.8) to find the Qs. 

There are several obvious dangers in using Eq. (1.11.8) in this way. For 
one thing, it is necessary to avoid missing sources that have high redshift and 
hence low apparent luminosity. Also, evolution in the number of sources 
can introduce an additional dependence on the light emission time ¢, and 
hence on z. In 1986 Loh and Spillar! carried out a survey of galaxy numbers 
as a function of redshift. The redshifts were measured photometrically (1. e., 
from their luminosities at various colors rather than by the shift of specific 
spectral lines), which generally gives less reliable results. Comparing their 
results with Eq. (1.11.8) in the case Qg = Qr = 0(so that Za + Qy = 1), 
they found that Q,/Qy = 0.17 o A By now it has been realized that the 
evolution of sources cannot be feclested at redshifts large enough for n(z) 
to be sensitive to cosmological parameters, and this result for Qa /Qy has 
been abandoned.” 

Useful results can be obtained when evolution is taken into account. 
One group? used number counts of very faint galaxies* as a function of 
apparent luminosity to estimate the free parameters in a model of galac- 
tic luminosity evolution (assuming the number of galaxies per coordinate 
volume to be constant), and then used this model together with a redshift 
survey extending to z ~ 0.47 to conclude that Q yy is small and that Q, isin 
the range of 0.5 to 1. More recently, several surveys? of numbers of galaxies 
at different redshifts that yield important results about galactic evolution, 
and with the use of dynamical models they can yield information about Q m 
and Q,.’ But it appears that number counts of galaxies will be more useful 
in learning about galactic evolution than in making precise determinations 
of cosmological parameters. In a dramatic application of this approach,® a 


lE, D. Loh, Phys. Rev. Lett. 57, 2865 (1986); E. D. Loh and E. J. Spillar, Astrophys. J. 284, 439 
(1986). 

2For a discussion of future prospects for measuring Q, in redshift surveys, see W. E. Ballinger, J. A. 
Peacock, and A. F. Heavens, Mon. Not. Roy. Astron. Soc. 282, 877 (1996). 

3M. Fukugita, F. Takahara, K. Yamashita, and Y. Yoshii, Astrophys. J. 361, L1 (1990). 

4J, A. Tyson, Astron. J. 96, 1 (1988). 

ST. J. Broadhurst, R. S. Ellis, and T. Shanks, Mon. Not. Roy. Astron. Soc. 235, 827 (1988). 

6G, Efstathiou, R. S. Ellis, B. A. Peterson, Mon. Not. Roy. Astron. Soc. 232, 431 (1988); J. 
Loveday, B. A. Peterson, G. Efstathiou, and S. J. Maddox, Astrophys. J. 390, 338 (1992); L. da Costa, 
in Proceedings of the Conference on Evolution of Large Scale Structure, Garching, August 1998 [astro- 
ph/9812258]; S. Borgani, P. Rosati, P. Tozzi, and C. Norman, Astrophys. J. 517, 40 (1999) [astro- 
ph/9901017]; S. J. Oliver, in Highlights of the ISO Mission: Special Scientific Session of the IAU General 
Assembly. eds. D. Lemke et al. (Kluwer) [astro-ph/9901272]; M. Colless, in Publ. Astron. Soc. 
Australia [astro-ph/99 11326]; S. Rawlings, astro-ph/0008067. 

TW. J. Percival et al., Mon. Not. Roy. Astron. Soc. 327, 1297 (2001) [astro-ph/0105252]; S. Borgnani 
et al., Astrophys. J. 561, 13 (2001) [astro-ph/0106428]. 

SR. J. Bouwens and G. D. Illingworth, Nature 443, 189 (2006). 
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search at the Lick Observatory for galaxies with redshifts in the range z ~ 
7 to 8 found at most just one galaxy, while it is estimated that if Eq. (1.11.7) 
were valid then, on the basis of the number of galaxies observed (with the 
same conservative selection criteria) at redshifts z ~ 6, ten galaxies should 
have been found with z ~ 7 to 8. The implication is that there must have 
been a spurt in the formation of luminous galaxies at a redshift in the range 
6 to 7. This fits in well with the conclusion discussed in Section 1.10, that 
the ionization of intergalactic hydrogen became essentially complete at a 
redshift of order 6, presumably due to ultraviolet radiation from massive 
stars formed around that time. 


Historically the first important application of number counts was in radio 
source surveys, where redshifts are not generally available. These surveys 
take place at a fixed receiving frequency v, corresponding to a variable 
emitted frequency v(1+z), so the source counts are affected by the frequency 
dependence of the distribution of intrinsic source powers. 

If a source with a redshift z emits a power? P(v)dv between frequencies 
v and v + dv, then the power received at the origin per unit antenna area 
between frequencies v and v + dv is 


P(vd s z)) dv(1+z) 


S(v)dv = 1.11.10 
o 4nd? (z) l ) 

Many radio sources have a “straight” spectrum, i.e. 
Pv) xv“ (1.11.11) 


with the spectral index a typically about 0.7 to 0.8. This allows a great 
simplification in Eq. (1.11.10): 
P(v)dv 


SUEY ea 


(1.11.12) 


From now on we will take the observed frequency v as fixed, and write 
S(v) = S and P(v) = P. Canceling dv, Eq. (1.11.12) then reads 


P 
S= . 
4r d? (z) (1 +2)¢! 


(1.11.13) 


°In G&C, P was defined as the power emitted per solid angle, while here it is the power emitted in 
all directions. 
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If at time ¢ there are N (P, t) dP sources per proper volume with power 
between P and P + dP, then the number of sources observed with power 
per antenna area greater than S is 


Asnr2a3(t) dr 
VIZ Kr2 ” 


with the upper limit on the integral over r set by the condition that 


n(> S= aP f Nen (1.11.14) 
0 


P 
apr? (1 +2)+ < re (1.11.15) 


Of course, r, z, and ¢ are related by the familiar formulas 
| r dr | 0 dt’ 
0 f/1—Kr2 Jt att’)’ 


This becomes much simpler if we assume that the time-dependence of 
the source number density can be parameterized as 


1+z=a(to)/a(t). (1.11.16) 


N(P, t) = N(P) ee (1.11.17) 


For instance, if sources do not evolve and are neither created nor destroyed, 
then 6 = —3, while in the steady-state model 8 = 0. Eq. (1.11.14) now 
reads 
Amr2(1 + z)~8-3 dr 
V1 — Kr 
with the same P/S-dependent upper limit (1.11.15) on r. 
The coordinate r is given in terms of z by the power series (1.4.8) 


n(> S =a [Nmap (1.11.18) 
0 


aoHor =z — Ł(1 + q0)? +... . (1.11.19) 
We can then convert the integral over r to one over z, with 
ayHo dr = dz[1 — (1 + qo)z+...], (1.11.20) 
and the upper limit on z is given by 
z [1 +z(œ&— qo) +...] < PH) 
4r S 
or, in other words, 
z< PR Ee qo) i ] (1.11.21) 
4r S 2 mS 
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Then Eq. (1.11.18) becomes 


4 CO 
n> 8) == f N(P) dP 
Ag 0 
esta aaa OF 3 ae — gq) (PHB LA 
x| = | — — (a — — a 
3\4ns a Das 


1 [PRV as 
A aes (E +5+2q0) +... |, 


or, collecting terms, 


1 
3/4 S3/2 


1/2 
3 PH 
x[i- 9 (5+6 +22) (=) +l (1.11.22) 


We see that n(> S) has a term with the familiar S7?/ dependence, 
plus a correction proportional to S7? with a coefficient proportional to 
5+ 6+ 2a. It is noteworthy that this coefficient is independent of go or 
K. For the standard cosmology with no evolution of sources 6 = —3, and 
we have mentioned that a ~ 0.75, so 5+ 6 + 2a = 3.5. Although the 
precise value is uncertain, this coefficient is definitely positive, which means 
that for faint sources n(> S) should fall off more slowly than S~?/?, This 
is definitely not what is observed.!° It has been known for many years that 
for S > 5 x 10-7°Wm~?/Hz, the source count function N(> S) falls off 
more rapidly than S~?/*. The conclusion is inevitable that the number of 
radio sources per co-moving volume is decreasing, with 6 < —6.5. Radio 
source counts are useful in studying this evolution, but not for measuring 
cosmological parameters. 

On the other hand, for the steady state cosmology (discussed in 
Section 1.5) we have 6 = 0, so the coefficient 5+ 6 + 2a ~ 6.5, and 
the predicted number count N(> S) decreases even more slowly with S, 
making the disagreement with experiment even worse than for the standard 
cosmology with no evolution of sources. Here it is not possible to save 
the situation by appealing to evolution, because the essence of the steady 
state model is that on the average there is no evolution. This observation 


n(> S) = | PENE dP 
0 


10For a list of major radio source surveys, and references to the original literature, see G&C, Sec. 14.8. 
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discredited the steady state model even before the discovery of the cosmic 
microwave radiation background. 


1.12 Quintessence 


So far, we have taken into account only non-relativistic matter, radiation, 
and a constant vacuum energy in calculating the rate of expansion of the 
universe. It appears that the vacuum energy is not only much smaller than 
would be expected from order-of-magnitude estimates based on the quan- 
tum theory of fields, but is only a few times greater than the present matter 
density. This has led to a widespread speculation that the vacuum energy 
is not in fact constant; it may now be small because the universe is old. A 
time-varying vacuum energy is sometimes called quintessence. ! 

The natural way to introduce a varying vacuum energy is to assume the 
existence of one or more scalar fields, on which the vacuum energy depends, 
and whose cosmic expectation values change with time. Scalar fields of this 
sort play a crucial part in the modern theory of weak and electromagnetic 
interactions, and are also introduced in theories of inflation, as discussed in 
Chapters 4 and 10. 

For simplicity, let us consider a single real scalar field g(x, t). We will 
be concerned here with fields that are vary little on elementary particle 
spacetime scales, so the action of these field is taken to have a minimum 
number of spacetime derivatives: 


1 ,, a9 a 
I=- f ats Dae 5028 8 + ve] (1.12.1) 


Ox* əx“ 


with an unspecified potential function V(g). We are interested here in the 
case of a Robertson—Walker metric, and a scalar field that depends only on 
time, not position. In this case the formulas (B.66) and (B.67) for the scalar 
field energy density and pressure become 


Bs 

po = 59 + VV) (1.12.2) 
bs 

Po = 5% — VY). (1.12.3) 


It follows immediately that (1 + w)py > 0, where w = py/pg, so as long as 
Py = 0 this model has w > —1, and the phantom energy disaster discussed 
in Section 1.6 does not occur. 


lFor reviews with references to the original literature, see B. Ratra and P. J. E. Peebles, Rev. Mod. 
Phys. 75, 559 (2003); E. V. Linder, 0704.2064. 
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The equation (1.1.32) of energy conservation here reads 
6+3HG+V'(y) =0, (1.12.4) 


(where as usual H(t) = a(t)/a(t)), which is the same as the field equation 
derived from the action (1.12.1). This is the equation of a particle of unit 
mass with one-dimensional coordinate g, moving in a potential V(g) with 
a frictional force —3H@. The field will run toward lower values of V (9), 
finally coming to rest if it can reach any field value where V (ọ) is at least a 
local minimum. Unfortunately, we do not know any reason why the value 
of V (g) where it is stationary should be small. 

Nevertheless, there are potentials that have some attractive properties 
once we adjust an additive constant in the potential to make them vanish 
at their stationary point. The original and simplest example is provided by 
a potential? 


Vo) = M 9% , (1.12.5) 


where a is positive but otherwise arbitrary, and M is a constant with the 
units of mass (taking A = c = 1), which gives V (ọ) the dimensions of 
an energy density. There is no special reason to believe that the potential 
has this form, and in particular there is no known reason for excluding 
an additive constant (including effects of quantum fluctuations in all other 
fields), which would give the potential a non-zero value at its stationary 
point, at g = oo. Nevertheless, it may be illuminating to work out the 
consequences of this one specific model of quintessence. 

For any potential it is necessary to assume that at sufficiently early times 
Py was much less than the energy density pr of radiation because, as we 
will see in Section 3.2, any appreciable increase in the energy density at the 
time of cosmological nucleosynthesis would lead to a helium abundance 
exceeding what is observed. At these early times the energy density of 
radiation (including particles like neutrinos with masses less than kgT) is 
also greater than that of non-relativistic matter, so Eq. (1.5.34) gives a(t) « 
t!/?, and therefore H = 1/2t. The field equation (1.12.4) with potential 
(1.12.5) then reads 


3 
öğ + x? — aM g! =0. (1.12.6) 


2P, J, E. Peebles and B. Ratra, Astrophys. J. 325, L17 (1988); B. Ratra and P. J. E. Peebles, Phys. Rev. 
D 37, 3406 (1988); C. Wetterich, Nucl. Phys. B302, 668 (1988). Quintessence models with this potential 
were intensively studied by I. Zlatev, L. Wang, and P. J. Steinhardt, Phys. Rev. Lett. 82, 896 (1999); P. J. 
Steinhardt, L. Wang, and I. Zlatev, Phys. Rev. D 59, 123504 (1999). 
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This has a solution 


1.12.7 
6+a ( ) 


1 
(2 + eue) 2+a 

Qz= A 
Both ġ? and V (g) then go as t7?*/C+% and therefore at very early times py 
must have been less than pr, which goes as t~2. This solution is not unique, 
but it is an attractor, in the sense that any other solution that comes close to 
it will approach it as ¢ increases. (To see this, note that a small perturbation 
ôg of the solution (1.12.7) will satisfy 


Sige SOs aS) 
uY (2+.a)202 


wa eek = 2 
0= 66+ stal +o)M iy e236 = 86+ 


This has two independent solutions of the form 


gxt, por 


Ka 1 (6+a)(+a) 
4 16 (2+ a)? 

The square root is imaginary for a > 0, so both solutions for dg decay as 
t—'/4 for increasing t, while itself is increasing.) For this reason, the 
particular solution of Eq. (1.12.6) that goes as Eq. (1.12.7) for t > 0 
is known as the tracker solution. There is no particular physical reason 
to require that the initial conditions for the scalar field are such that the 
scalar field has approached the tracker solution by the present moment 
(the set of such initial conditions is called the “basin of attraction”), but 
since this requirement would make the present evolution of the scalar field 
insensitive to the initial conditions, it has the practical advantage of pro- 
viding a model of quintessence with just two free parameters: M 
and g. 

Nothing much changes when the radiation energy density drops below 
the energy density of non-relativistic matter. The tracker solution for the 
scalar field continues to grow as ¢?/?+ (though with a different constant 
factor), so ¢” and V (ọ) continue to fall as 1 ?®/C+%)., But om and pp are 
decreasing faster, like t~? and 178/3, respectively, so eventually pm and pr 
will fall below py. Itis interesting that the value of g where py becomes equal 
to py is independent of the unknown constant M. When the expansion is 
dominated by matter pm is given by Eq. (1.5.31) as 1/6 Gr’, while (1.1.2), 
(1.12.5) and (1.12.7) give py ~ M?4+%/C+a) ;—20/2+2) s0 the time te at 
which py = pm is of order 


tee MOOR G Crit | (1.12.8) 
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Using this in Eq. (1.12.7) then gives 
olt) x GT. (1.12.9) 
Once pm falls well below py, the equation of motion of g(t) becomes 
G+ /24nGpy¢ —aM***o-*"! =0, (1.12.10) 


with py given by Eq. (1.12.2). The tracker solution in this era has a 
complicated time dependence, but it becomes simple again at sufficiently 
late times, times that may be later than the present. We can guess that the 
damping term proportional to ¢ in this equation will eventually slow the 
growth of ọ, so that ¢* will become less than V(y), and also guess that 
the inertial term proportional to @ will become negligible compared to the 
damping and potential terms. (Similar “slow roll” conditions will play an 
important role in the theory of inflation, described in Chapters 4 and 10.) 
Equation (1.12.10) then becomes 


24r GM4+¢g-¢ ġ = Mtg! , 


aM 2+2/2g-a/2-1 


g= 
v 241 G 


and so 


(1.12.11) 
The solution is 


2 2 t 1/2+a/2) 
p=m(* TULI ) l (1.12.12) 


~v 24r G 


(In general this involves a redefinition of the zero of time, to avoid a 
possible integration constant that might be added to t.) We can now 
check the approximations used in deriving Eq. (1.12.11), of which this is the 
solution. From Eq. (1.12.12) we see that ġ? « 17 C+”/C+/?) while V (p) « 
t~¢/+4/2) sq the kinetic energy term in Eq. (1.12.2) does become small 
compared with the potential term at late times. Also, ¢ « 17 8t%)/@C+a/2) 
while V’(y) x t~(+%/C2+e@/2) so the inertial term in Eq. (1.12.10) does 
become small compared with the potential term at late times. Eq. (1.12.12) is 
therefore a valid asymptotic solution of Eq. (1.12.10) for t + oo. Numerical 
calculations show that it is not only a solution for t > ov; itis the asymptotic 
form approached for t —> oo by the tracker solution. 

With py « t~*/?+¢/2) dominating the expansion rate at late times, we 
have a/a x t~%/22+4/2), so 


Ina x P/C+e/?) | (1.12.13) 
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This is a similar but less rapid growth of a than would be produced by 
a cosmological constant, for which Ina œ t. The difference between the 
deceleration parameter go and the value —1 for an expansion dominated 
by a cosmological constant vanishes as t~?+®/2+@/2)_ Note that the radi- 
ation and matter densities decrease as 1/a* and 1/a? respectively, and the 
curvature decreases as | / a’, all of which have a much faster rate of decrease 
with time than the power-law decrease of py, so the expansion rate is indeed 
dominated by py at late times, justifying the derivation of Eq. (1.12.10). 

We have found that, at least for a range of initial conditions, the potential 
(1.12.5) leads to an expansion that is dominated by radiation and then matter 
at early times, but becomes dominated by the scalar field energy at late times. 
But to get agreement with observation it is necessary arbitrarily to exclude 
a large constant term that might be added to (1.12.5), and also to adjust 
the value of M to make the critical time (1.12.8) at which the values of 
Py and py cross be close to the present moment to ~ 1/Ho. Specifically, 
Eq. (1.12.8) shows that we need the constant factor in V (ọ) to take the 
value 


Mitte x G! He . (1.12.14) 


There is no known reason why this should be the case. 

Several groups of observers are now planning programs to discover 
whether the vacuum energy density is constant, as in the case of a cos- 
mological constant, or changing with time. In such programs, one would 
compare the observed luminosity distance (or angular diameter distance) 
with a formula obtained by replacing the term Q, in the argument of the 
square root in Eq. (1.5.45) with a time-varying dark energy term. These 
observations will not actually measure the value wo of w at the present time, 
much less the present time derivatives wo, Wo, etc., because for that pur- 
pose it would be necessary to have extremely precise measurements of the 
luminosity distance or angular-diameter distance for small redshifts. Ins- 
tead, measurements will be made with only moderate precision, but over a 
fairly large range of redshifts. To compare such measurements with theory, 
one needs a model of the time-variation of the dark energy. One model is 
simply to assume that w is constant, or perhaps varying linearly with time 
or redshift, but there is no physical model that entails such behavior.’ It 
seems preferable to compare observation with the model of a scalar field 
rolling down a potential, which (whatever reservations may have about its 
naturalness) at least provides a physically possible model of varying dark 


3 Other assumptions about the form of w as a function of redshift that can mimic scalar field models 
have been considered by J. Weller and A. Albrecht, Phys. Rev. D 65, 103512 (2002); E. V. Linder, Phys. 
Rev. Lett. 90, 091301 (2003) [astro-ph/0208512]. 
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energy.* Because these observations are difficult, it pays to adopt scalar field 
models with just two parameters, which can if we like be expressed in terms 
of Qa = 1 — Qy (assuming flatness and neglecting the radiation energy 
density) and wo. 

One possibility is to suppose that over the latest e-folding of cosmic 
expansion the scalar field g has taken values for which V (ọ) is only slowly 
varying. If V(~) were constant, we would have a constant vacuum energy, 
with w = —1, and the only parameter to measure would be Qy. For a 
two-parameter fit, we can take V (ọ) to vary linearly with ¢: 


Vy) = Vo + (o-w) Vi. (1.12.15) 


This is valid if the fractional change in V’(g) in a time interval of order 
1/Hp is small; that is, if |V ġo] « Hol Vgl. 

The field equation (1.12.4) for g(t) can be put in a convenient dimension- 
less form by replacing the dependent variable ¢ and independent variable gy 
with dimensionless variables x and w, defined by 


82 GV (Q) 
x= HW 2Mmt, —. 1.12.16 
ov Qu w 32u H? ( ) 


Because V is linear in y, we have 
_ 3QyHj® 32W He do 


© 8aGV 8G; dx ` 
Then Eq. (1.12.4) becomes 


d*w dw 
— +3H—+A=0, 1.12.17 
qo gee ( ) 
where à is the dimensionless parameter 
82 GV? 
3HEQ%, 
and H is a function of w and dw/dx: 
H g d+ +045 ) T (1.12.19) 
~ eaae Z (40) mae Wee $ . . 
Hov 2m dx 


4This approach is followed by D. Huterer and H. V. Peiris, Phys. Rev. D 75, 083502 (2007) [astro- 
ph/0610427]; R. Crittenden, E. Majerotto, and F. Piazza, astro-ph/0702003. 


94 


1.12 Quintessence 


We will also need the differential equation for the redshift: 
— =—-H(1+2). (1.12.20) 


In general, even if we wrote all derivatives with respect to x in terms of 
derivatives with respect to z, to solve these equations we would need initial 
conditions for w and dw/dz at some initial z, which with A would give a three- 
parameter set of solutions. However, assuming that for large redshift the 
energy density is dominated by matter rather than vacuum energy (which as 
we shall see is the case), the derivative dw/dx sufficiently late in the matter- 
dominated era becomes quite insensitive to initial conditions.” For z > 1, 
Eq. (1.12.19) gives 


H > 142)". (1.12.21) 
and (1.12.17) and (1.12.20) then have the solution 
3x\ 78 da Ax 
l — — -—. 1.12.22 
TRT ( 2 ) í ds 3 ( ) 


(An integration constant in the solution for z has been absorbed into the 
definition of x, setting the zero of time. An integration constant in the 
solution for dw/dx has been dropped, because it gives a term in dw/dx that 
dies away with increasing time as x~? œ t72.) The free parameters in our 
solution are then à, together with the value of w at some arbitrary initial 
value x; of x, taken sufficiently small so that at x; the energy density is 
dominated by matter rather than vacuum energy. (Note that the constant 
Vo appears nowhere in these equations; it contributes a term to w(x), but 
there is no need to isolate this term.) One must adopt various trial values 
of à and @(x1); use Eq. (1.12.22) to calculate 1 + z and dw/dx at x = x1; 
with these initial conditions, integrate the differential equations (1.12.17) 
and (1.12.20) numerically from x; to a value xo where z = 0; and then if we 
like calculate the values of Qy = 1 — Qm and the present value wo of the 
ratio py/ py for this particular solution,® using 


Qa hee ( 2 (d/d, — 240 (x0) 

—— =o\x As i cyoo > W= ` 

Om ATANDE T dod) a O 
(1.12.23) 


5R. Cahn, private communication. Cahn has also shown that the approximation of neglecting the 
second derivative term in the field equation does not work well in this context. 

As already mentioned, with models of this sort one can only have wọ > —1. To compare the case 
wo < —1 with observation, it is necessary to adopt a model with the opposite sign for the derivative 
term in the action (1.12.1). The analysis given here can then be applied, with only obvious sign changes 
here and there. 
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The ratio of the dark energy at a given time to its value at the 
present is 


_ pvt) _ dea(x)/dx)* + 22o(x) 
~ py(to) — (dw/dx)2_,,, + 20(Xx0) 


(1.12.24) 


For instance, if we take Q, = 1 — Qy = 0.76 and wo = —0.777, the ratio 
& of the dark energy density to its value at present rises to 1.273 at z = 1 
and to 1.340 at infinite redshift.’ The leveling off of &(z) for large z occurs 
because the growth of the matter density for increasing redshift makes the 
expansion rate grow, so that the friction term 3H@ in Eq. (1.12.4) freezes 
the value of the scalar field at early times. 

It should not be thought that the leveling off of the dark energy for large 
z for the potential (1.12.15) means that in analyzing dark energy obser- 
vations with this potential one must give up the idea motivating theories 
of quintessence, that the vacuum energy is now small because the uni- 
verse is old. In fact, for the potential V(g) x gw, for typical initial 
conditions the quintessence energy drops at first precipitously, and then 
levels off while the scalar field rolls slowly down the potential until the 
field approaches the tracker solution, with the tracker solution not reached 
by the present time if œ is small. The condition [Vij pol K Hol|V | for 
treating this potential as linear over a time of order 1/Hp is satisfied if 
a(l +a)¢9° « 87G, which in light of Eq. (1.12.9) is likely to be satisfied if 
a<l. 

Another possible two-parameter model is provided by the same poten- 
tial, V (p) x g~%, but now under the assumption that the tracker solution 
is reached by some early time (say, for z < 10) in the matter-dominated era. 
With this assumption the observable history of dark energy is insensitive 
to initial conditions, so the model has just two parameters: M and a. The 
equations of this model can be put in dimensionless form by writing the 
coupling constant of this potential in terms of a dimensionless parameter 
Bas 

M = B Qu He (81G) 1-2? (1.12.25) 


and replacing the dependent variable t and independent variable g with 
dimensionless variables x and f, defined by 


t = x/ HoV Qm , y(t) =f (x)/V8xG . (1.12.26) 


7Numerical results for various values of redshift are given in Table 1.1. These results for the linear 
potential were calculated by R. Cahn, private communication. 
8Steinhardt, Wang, and Zlatev, ref. 2. 
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The field equation (1.12.4) (with no slow roll approximation) in the era 
dominated by matter and vacuum energy then takes the form 


2 
S + sn” — apf 7! = (1.12.27) 


where 


n=nNaum= ji (L) +b anw 


in which we also need 
— =-H(1 +2). (1.12.29) 
X 


Because the large z solution (1.12.7) is an attractor, the initial conditions 
introduce no new free parameters; in terms of these dimensionless variables, 
the initial conditions are that, for x > 0, 


2.27 1/a+2) 2/3 
a 


We need to integrate the equations (1.12.27) and (1.12.29) from some small 
x (say, x = 0.01) to a value xo at which z = 0, with the initial conditions 
(1.12.30), and then evaluate Qm = 1— Qa from the condition that H(x9) = 
1/V2m. We can also evaluate the present value wo of w = pg/py from the 
formula 


f? (xo)f (x0)*/2B — 1 
Sf (xo)f (x0)*/2B +1 


and then replace the parameters a and 6 with Qj and wo. For instance, 
if we arbitrarily take a = 1, then to get the realistic value Qyy = 0.24 we 
must take 6 = 9.93, in which case wọ = —0.777. Of course, we can get any 
other values of wo greater than —1 by choosing different values of a and 
re-adjusting 6 to give the same value of Quy (though for small a, the 
range of initial conditions that allow the tracker solution to be reached 
well before the present is relatively small.) For instance, fora = 1/2 we 
must take 6 = 7.82 to have Qy = 0.24, and in this case we calculate that 


wọ = (1.12.31) 


wo = —0.87. (For the case w < —1, see footnote 6.) The ratios of dark 
energy to its value at present calculated in this way for Qm = 0.24 and 
wọ = —0.777 are shown in Table 1.1, along with the values calculated with 


the same choice of Qy and wo for both the case of constant w and for 
the linear potential (1.12.15). The tracker and linear models evidently rep- 
resent opposite extreme assumptions about the time-dependence of dark 
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Table 1.1: Ratio of dark energy to its present value, for the tracker solution with the potential 
(1.12.5), and for the linear potential (1.12.15), calculated for Qy = 1—Q, = 0.24 and 


wo = —0.777, compared with the results for a constant w = —0.777. 

Z tracker linear constant w 
0 1 1 1 

0.1 1.067 1.062 1.066 
0.5 1.347 1.200 1.312 

1 1.712 1.273 1.590 

2 2.469 1.318 2.086 

3 3.224 1.331 2.528 
>1 >1 1.340 > 1 


energy, but both are better motivated physically than the assumption of a 
constant w. 


1.13 Horizons 


Modern cosmological theories can exhibit horizons of two different types, 
which limit the distances at which past events can be observed or at which it 
will ever be possible to observe future events. These are called by Rindler! 
particle horizons and event horizons, respectively. 

According to Eq. (1.2.2), if the big bang started at a time t = 0, then 
the greatest value rmax(t) of the Robertson—Walker radial coordinate from 
which an observer at time ¢ will be able to receive signals traveling at the 
speed of light is given by the condition 


t / Tmax (t) 
Í u = mA (1.13.1) 
o a(t’) 0 V1 — Kr? 


Thus there is a particle horizon unless the integral f dt/a(t) does not con- 
verge at t = 0. It does converge in conventional cosmological theories; 
whatever the contribution of matter or vacuum energy at the present, it is 
likely that the energy density will be dominated by radiation at early times, 
in which case a(t) « t!/, and the integral converges. The proper distance 


1W. Rindler, Mon. Not. Roy. Astron. Soc. 116, 663 (1956). 
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of the horizon is given by Eq. (1.1.15) and (1.13.1) as 


Tmax (2) 


dmax(t) = an | (1.13.2) 
(0) 


V1 — Kr — Kr? ee 


For instance, during the radiation-dominated era a(t) « t!/*, so dmax(t) = 
2t = 1/H. Well into the matter-dominated era most of the integral over 
time in Eq. (1.13.1) comes from a time when a œ #?/3, so that dmax(t) > 
3t = 2/H. At present most of the integral over t’ comes from a period when 
the expansion is dominated by matter and the vacuum energy, and perhaps 
curvature as well. According to Eq. (1.5.41), the particle horizon distance 
at present is 


1 l dx 
dmax(to) = / - (1.13.3) 
ae Ho Jo x2./Qa + Kx? + Qy x3 


We will seein Chapter 4 that there may have been a time before the radiation- 
dominated era in which there was nothing in the universe but vacuum energy, 
in which case the particle horizon distance would actually be infinite. But as 
far as telescopic observations are concerned, Eq. (1.13.3) gives the proper 
distance beyond which we cannot now see. 

Just as there are past events that we cannot now see, there may be events 
that we never will see. Again returning to Eq. (1.2.2), if the universe re-coll- 
apses at a time T, then the greatest value rmax ofr from which an observer 
will be able to receive signals traveling at the speed of light emitted at any 
time later than ¢ is given by the condition 


T TMAX (2) 
J Eae | ES (1.13.4) 
, a(t’) 0 V1 — Kr? 


Even if the future is infinite, if the integral f dt/a(t) converges at t = oo 
there will be an event horizon given by 


oe) TMAX (2) 
aay et ue 
1 at’) Jo V1 — Kr? 


Since co-moving sources are labeled with a fixed value of r, the condition 
r < rMAx limits the events occurring at time t that we can ever observe. In 
the case where the universe does not recollapse, the proper distance to the 
event horizon is given by 


Fi eae Mes. aaa 
max() = ato) | ss = 0 | at’) ” (1.13.6) 
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In the absence of a cosmological constant, a(t) grows like 17/3, and the 
integral diverges, so that there is no event horizon. But with a cosmological 
constant a(t) will eventually grow as exp(Ht) with H = HQ constant, 
and there really is an event horizon, which approaches the value dyax (00) = 
1/H. As time passes all sources of light outside our gravitationally bound 
Local Group will move beyond this distance, and become unobservable. The 
same is true for the quintessence theory described in the previous section. 
In that case a(t) eventually grows as exp(constant x t7/?+@/?)), so for any 
a > 0 the integral (1.13.6) again converges. 

If a source is at a radial coordinate r in a Robertson—Walker coordinate 
system based on us, then we are at a radial coordinate r in a Robertson— 
Walker coordinate system based on the source. Hence Eq. (1.13.4) or 
(1.13.5) also gives the greatest radial coordinate to which, starting at time 
t, we will ever be able to travel. 
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The Cosmic Microwave Radiation Background 


Before the mid-1960s by far the greatest part of our information about the 
structure and evolution of the universe came from observations of the red- 
shifts and distances of distant galaxies, discussed in the previous chapter. In 
1965 a nearly isotropic background of microwave radiation was discovered, 
which has provided a wealth of new cosmological data. After reviewing 
the expectations and discovery of this radiation, this chapter will explore 
some of its implications. We will only be able to give a first look at the 
anisotropies in this radiation in this chapter. In Chapter 7 we will return to 
this very important topic, applying the analysis of the evolution of cosm- 
ological perturbations presented in Chapters 5 and 6, and in Chapter 10 we 
will consider the origin of these perturbations in the very early universe. 


2.1 Expectations and discovery of the 
microwave background 


The work done by pressure in an expanding fluid uses heat energy drawn 
from the fluid. The universe is expanding, so we expect that in the past 
matter was hotter as well as denser than at present. If we look far enough 
backward in time we come to an era when it was too hot for electrons to be 
bound into atoms. At sufficiently early times the rapid collisions of photons 
with free electrons would have kept radiation in thermal equilibrium with 
the hot dense matter. The number density of photons in equilibrium with 
matter at temperature T at photon frequency between v and v + dv is given 
by the black-body spectrum: 


8mv? dv 


exp (hv/kgT) — 1” ea") 


nr(v)dv = 


where A is the original Planck’s constant (which first made its appearance in 
a formula equivalent to this one), and kg is Boltzmann’s constant. (Recall 
that we are using units with c = 1.) 

As time passed, the matter became cooler and less dense, and eventually 
the radiation began a free expansion, but its spectrum has kept the same form. 
We can see this most easily under an extreme assumption, that there was 
a time tz when radiation suddenly went from being in thermal equilibrium 
with matter to a free expansion. (The subscript L stands for “last scat- 
tering.”) Under this assumption, a photon that has frequency v at some 
later time t when photons are traveling freely would have had frequency 
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va(t)/a(tz) at the time the radiation went out of equilibrium with matter, 
and so the number density at time ¢ of photons with frequency between v 
and v + dv would be 


3 
nv, D dv = (alt) aO) nra)(va/alr)) dea fan), 2.1.2) 


3 
with the factor (atr) /a(o) arising from the dilution of photons due to 


the cosmic expansion. Using Eq. (2.1.1) in (2.1.2), we see that the redshift 
factors a(t)/a(tz) all cancel except in the exponential, so that the number 
density at time ¢ is given by 

87v? dv 


nv, t)dv = ap Gu/keT@) = 1 =nrn(v)dv, (2.1.3) 


where 
T(t) = T(tr)a(tt)/a(t) . (2.1.4) 


Thus the photon density has been given by the black-body form even after 
the photons went out of equilibrium with matter, but with a redshifted 
temperature (2.1.4). 

This conclusion is obviously unchanged if the transition from 
opacity to transparency occupied a finite time interval, as long as the 
interactions of photons with matter during this interval are limited to 
elastic scattering processes in which photon frequencies are not changed. 
This is a very good approximation. We will see in Section 2.3 that the 
last interaction of photons with matter (until near the present) took place 
at a time when the cosmic temperature T was of order 3,000 K, when by 
far the most important interaction was the elastic scattering of photons 
with electrons, in which the fractional shift of photon frequency was of 
order kgT /mec” ~ 3 x 1077. In the following section we shall show that, 
because of the large photon entropy, even the small shift of photon fre- 
quency in elastic scattering and the relatively infrequent inelastic interac- 
tions of photons with hydrogen atoms had almost no effect on the photon 
spectrum. 

It was George Gamow and his collaborators who first recognized in the 
late 1940s that the universe should now be filled with black-body radiation.! 
The first plausible estimate of the present temperature of this radiation was 


1G. Gamow, Phys. Rev. 70, 572 (1946); R. A. Alpher, H. A. Bethe, and G. Gamow, Phys. Rev. 73, 
803 (1948); G. Gamow, Phys. Rev. 74, 505 (1948); R. A. Alpher and R. C. Herman, Nature 162, 774 
(1948); R. A. Alpher, R. C. Herman, and G. Gamow, Phys. Rev. 74, 1198 (1948); ibid 75, 332A (1949); 
ibid 75, 701 (1949); G. Gamow, Rev. Mod. Phys. 21, 367 (1949); R. A. Alpher, Phys. Rev. 74, 1577 
(1948); R. A. Alpher and R. C. Herman, Phys. Rev. 75, 1089 (1949). 
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made in 1950 by Ralph Alpher and Robert Herman.* On the basis of 
considerations of cosmological nucleosynthesis, to be discussed in Sec- 
tion 3.2, they found a present temperature of 5 K. This work was largely 
forgotten in subsequent decades, until in 1965 a group at Princeton started 
to search for a cosmic radiation background left over from the early uni- 
verse. They had only a rough idea of the temperature to be expected, based 
on a nucleosynthesis calculation of P. J. E. Peebles, which suggested a value 
of 10 K.* Before they could complete their experiment the radiation was 
discovered in a study of noise backgrounds in a radio telescope by Arno 
Penzias and Robert Wilson,* who published their work along with a com- 
panion article’ by the Princeton group explaining its possible cosmological 
significance.® 

Originally Penzias and Wilson could only report that the antenna 
temperature at a wavelength 7.5 cm was 3.5+1.0 K, meaning that the int- 
ensity of the radiation at this one wavelength agreed with Eq. (2.1.1) for 
this temperature. This of course did not show that they were observing 
black-body radiation. Then Roll and Wilkinson’ measured the radiation 
intensity at a wavelength of 3.2 cm, finding an antenna temperature of 
3.0 + 0.5 K, in agreement with what would be expected for black-body 
radiation at the temperature measured by Penzias and Wilson. In the fol- 
lowing few years a large number of measurements were made by other 
radio astronomers at other wavelengths. These measurements also gave 
antenna temperatures at the wavelengths being studied around 3 K, with 
uncertainties that gradually improved to of order 0.2 K. But this also did 
not establish the black-body nature of the radiation, because these mea- 
surements were all at wavelengths greater than about 0.3 cm, where the 
black-body energy distribution hvny (v) with T ~ 3 K has its maximum. 
For these long wavelengths the argument of the exponential is small, and 
Eq. (2.1.1) gives 


hvnr(v) x 8nv-kpT , (2.1.5) 


This is the Rayleigh—Jeans formula of classical statistical mechanics, but 
it describes the long-wavelength distribution of radiant energy under vari- 


2R. A. Alpher and R. C. Herman, Rev. Mod. Phys. 22, 153 (1950) 

3This work was never published. According to A. Guth, The Inflationary Universe (Perseus Books, 
Reading, MA, 1997), Peebles’ paper was rejected by The Physical Review, apparently because of the 
issue of the credit to be given to earlier work by R. Alpher, G. Gamow, and R. Herman. This earlier 
work and the subsequent work of Peebles and others is briefly described here in Section 3.2. 

4A. A. Penzias and R. W. Wilson, Astrophys. J. 142, 419 (1965). 

SR. H. Dicke, P. J. E. Peebles, P. G. Roll, and D. T. Wilkinson, Astrophys. J. 142, 414 (1965). 

6For a more detailed history of these developments, see A. Guth, op. cit., and S. Weinberg, The First 
Three Minutes (Basic Books, New York, 1977; second edition 1993). 

TP. G. Roll and D. T. Wilkinson, Phys. Rev. Lett. 16, 405 (1966). 
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ous circumstances more general than black-body radiation. For instance, 
black-body radiation diluted by an expansion that preserves the energy of 
individual photons also has hvn(v) œ v? for low frequencies.’ To con- 
firm that the cosmic microwave background radiation is really described 
by the black-body formula, it was necessary to see at least the beginning 
of the exponential fall-off of n7r(v) for wavelengths shorter than 
about 0.3 cm. 

This was difficult because the earth’s atmosphere becomes increasingly 
opaque at wavelengths shorter than about 0.3 cm. However, there had been 
a measurement of the radiation temperature at a wavelength 0.264 cm in 
1941, long before the discovery by Penzias and Wilson. In between the 
star ¢ Oph and the earth there is a cloud of cold molecular gas, whose 
absorption of light produces dark lines in the spectrum of the star. In 
1941 W. S. Adams,’ following a suggestion of Andrew McKellar, found 
two dark lines in the spectrum of ¢ Oph that could be identified as due 
to absorption of light by cyanogen (CN) in the molecular cloud. The first 
line, observed at a wavelength of 3,874.62 A, could be attributed to absorp- 
tion of light from the CN ground state, with rotational angular momentum 
J = 0, leading to the component of the first vibrationally excited state 
with J = 1. But the second line, at 3,874.00 A, represented absorption from 
the J = 1 rotationally excited vibrational ground state, leading to the J = 2 
component of the first vibrationally excited state.!° From this, McKellar 
concluded!! that a fraction of the CN molecules in the cloud were in the 
first excited rotational component of the vibrational ground state, which 
is above the true J = 0 ground state by an energy hc/(0.264 cm). and 
from this fraction he estimated an equivalent molecular temperature of 2.3 
K. Of course, he did not know that the CN molecules were being excited 
by radiation, much less by black-body radiation. After the discovery by 
Penzias and Wilson several astrophysicists!? independently noted that the 
old Adams—McKellar result could be explained by radiation with a 


8The sunlight falling on the earth’s surface provides a pretty good example of dilute black-body 
radiation; it is described by the Planck formula (2.1.1), with T ~ 6,000 K the temperature of the sun’s 
surface, but with the right-hand side of Eq. (2.1.1) multiplied by a factor (Ro IPÊ, where Ro is the 
radius of the sun and r is the distance from the sun to the earth. 

°W. S. Adams, Astrophys. J. 93, 11 (1941) 

10Today the wavelengths of these two lines are more accurately known to be 3,874.608 and 3,873.998 
Å. There is another line at 3875.763 Å, produced by transitions from the J = 1 rotationally excited 
vibrational ground state to the J = 0 component of the first vibrationally excited state. 

1A, McKellar, Publs. Dominion Astrophys. Observatory (Victoria, B.C.) 7, 251 (1941). 

12G. Field, G. H. Herbig, and J. L. Hitchcock, Astron. J. 71, 161 (1966); G. Field and J. L. Hitchcock, 
Phys. Rev. Lett. 16. 817 (1966); Astrophys. J. 146, 1 (1966); N. J. Woolf, quoted by P. Thaddeus 
and J. F. Clauser, Phys. Rev. Lett. 16, 819 (1966); I. S. Shklovsky, Astronomicheskii Tsircular No. 
364 (1966). 
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black-body temperature at wavelength 0.264 cm in the neighborhood of 
3 K. Theoretical analysis showed that nothing else could explain the exci- 
tation of this rotational state.!> This interpretation was then borne out by 
continuing observations on this and other absorption lines in CN as well as 
CH and CH? in the spectrum of ¢ Oph and other stars. '* 

The black-body spectrum of the cosmic microwave radiation background 
began to be established by balloon-borne and rocket-borne observations 
above the earth’s atmosphere at wavelengths below 0.3 cm. For some years 
there were indications of an excess over the black-body formula at these 
short wavelengths. It was clearly necessary to do these observations from 
space, but this is difficult; to measure the absolute value of the microwave 
radiation intensity it is necessary to compare the radiation received from 
space with that emitted by a “cold load” of liquid helium, which rapidly 
evaporates. Finally, the Planck spectrum of the microwave background was 
settled in the 1990s by observations with the FIRAS radiometer carried by 
the Cosmic Background Explorer Satellite (COBE), launched in November 
1989.'5 When a slide showing the agreement of the observed spectrum with 
the Planck black-body spectrum was shown by J. C. Mather at a meeting of 
the American Astronomical Society in January 1990, it received a standing 
ovation. It was found that the background radiation has a nearly exact 
black-body spectrum in the wavelength range from 0.5 cm to 0.05 cm.!6 
The comparison of observation with the black-body spectrum is shown in 
Figure 2.1. After six years of further analysis, the temperature was given 
as 2.725 + 0.002 K (95% confidence).!’ Other observations at wavelengths 
between 3 cm and 75 cm and at 0.03 cm are all consistent with a Planck 
distribution at this temperature. !® 

The energy density in this radiation is given by 


[0,0] 
Í hvn(v) dv = agT* (2.1.6) 
0 
where ag is the radiation energy constant; in c.g.s. units, 
82 °kp 
ag = ——~® = 7.56577(5) x 10715 erg em=3 deg~4 (2.1.7) 
15h3¢3 


13 Field et al., ref. 12; Thaddeus and Clauser, ref. 12. 

l4For a summary of this early work with references to the original literature, see G&C, Table 15.1. 

15J, C, Mather et al., Astrophys. J. 354, 237 (1990). 

16J, C, Mather et al., Astrophys. J. 420, 439 (1994). 

177, C. Mather, D. J. Fixsen, R. A. Shafer, C. Mosier, and D. T. Wilkinson, Astrophys. J. 512, 511 
(1999). A 1999 review by G. F. Smoot, in Proc. 3K Cosmology Conf., eds. A. Melchiorri et al. [astro- 
ph/9902027], gave a temperature 2.7377 + 0.0038 K (95% confidence), but the result of Mather et al. 
seems to be the one usually quoted. 

18 For a review, see G. Sironi et al., in Proc. Third Sakharov Conf. — Moscow 2002 [astro-ph/0301354]. 
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Figure 2.1: Comparison of the intensity of radiation observed with the FIRAS radiometer 
carried by COBE with a black-body spectrum with temperature 2.728 K, from D. J. Fixsen 
et al., Astrophys. J. 473, 576 (1996) [astro-ph/9605054]. The vertical axis gives the 
intensity in kiloJansky per steradian (one Jansky equals 10723 erg cm~? s7! Hz); 
the horizontal axis gives the reciprocal wavelength in cm~!. The lo experimental 
uncertainty in intensity is indicated by the tiny vertical bars; the uncertainty in wavelength is 
negligible. 


Using T = 2.725 K, this gives an equivalent mass density (reverting to 
c=1) 

Py0 = apT yo = 4.64 x 10-76 cm~’. 
Taking the ratio of this with the critical density (1.5.28) gives 


Q, = 2 = 247 x 1075h? (2.1.8) 


POcrit 


We will see in Section 3.1 that the photons are accompanied with neutrinos 
and antineutrinos of three different types, giving a total energy density in 
radiation (that is, in massless or nearly massless particles): 


7 4 ve 34 3 
PRO = 1+3(2) (=) py = 7.80 x 1074gem—3, (2.1.9) 


or in other words, using Eq. (1.5.28), 


PRO 
P0,crit 


QR = = 4.15 x 10747? . (2.1.10) 
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We see that pro is much less than the critical mass density needed to give 
K = 0, and much less even than the mass density of ordinary matter seen in 
stars. It is for this reason that we have generally neglected Qp in calculating 
luminosity distances as a function of redshift. 

On the other hand, even at present the number density of photons is 
relatively very large. Eq. (2.1.1) gives 


Pree iE 82 v2 dv _ 30¢(3) apT? 
pom o exp(hv/kgT)—-1 24 kg 
agT? 3-3 
= 0.3702 = 20.28 [T (deg K)]? cm” , (2.1.11) 


kg 


where ¢(3) = 1.202057... For T = 2.725 K this gives a present number 
density 


n,o = 410 photons/cm? . (2.1.12) 


This is much larger than the present number density ngo of nucleons, given 
by 


2 
ngo = ee = 1.123 x 1075 2z h? nucleons/cem? . (2.1.13) 


Both n, and ng vary with time as a~>(t), so the ratio ny /ng has been the 
same at least during the whole period that photons have been traveling freely. 


x OK k 


There is an effect of the cosmic microwave background that has long been 
expected but has been difficult to observe. A cosmic ray proton of moderate 
energy striking a photon in the cosmic microwave background can only 
scatter the photon, a process whose rate is proportional to the square of the 
fine structure constant a ~ 1/137. However, if the proton has sufficiently 
high energy then it is also possible for the photon to be converted into a 
x meson in the reactions y + p > x? +p or y +p —> a+ +n, processes 
whose rate is proportional to a, not a*. Assuming that high energy cosmic 
rays come to us from outside our galaxy, we therefore expect a dip in the 
spectrum of cosmic ray protons at an energy where the cross section for these 
processes becomes appreciable.!? Although some pions can be produced at 
lower energy, the effective threshold is at a value of the total energy W of the 


19K, Greisen, Phys. Rev. Lett. 16, 748 (1966); G. T. Zatsepin and V. A. Kuzmin, Pis'ma Sh. Exsp. 
Teor. Fiz. 4, 114 (1966) [transl. Sov. Phys. JETP Lett. 4, 78 (1966)]; F. W. Stecker, Phys. Rev. Lett. 21, 
1016 (1968). 
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initial proton and photon in the center-of-mass system equal to ma = 1232 
MeV, the mass of the pion—nucleon resonance with spin-parity 3/2 and 
isospin 3/2. The center of mass energy is 


W = G [p? +m? j — are) 
~ (2ar(1 —cos6) +m) l (2.1.14) 


where q and pare the initial photon and proton momenta (with p > mp > q) 
and @ is the angle between them. The threshold condition that W > ma 
thus requires that 

qp(l — cos 0) > mx — m3 . (2.1.15) 
The typical energy of photons in black-body radiation at temperature Tyo = 
2.725K is pyo/Nyo = 6 x 1074 eV, while the largest value for 1 — cos 8 is 2, 
so the effective threshold is roughly at a proton energy 


2 2 
mM — m 

Pthreshola ¥ —-——" > 10” eV. (2.1.16) 
2py0/ny0 


This effect is not easy to see. The flux of cosmic ray protons with energies 
between E and E + dE goes roughly as E7? dE, so there are few protons at 
these very high energies, roughly one per square kilometer per year above 
10!? eV and 0.01 per square kilometer per year above 10° eV. At these 
rates, direct observation is clearly impossible, and the cosmic rays have had 
to be studied indirectly by observation at ground level of the large showers 
of photons and charged particles that they produce. Also, there is a smooth 
distribution of photon energies and directions, so one is not looking for a 
sharp cut-off at 10% eV, but rather for a dip below the E7? curve at around 
this energy.” No such effect was observed by the Akeno Giant Air Shower 
Array,”! but a subsequent analysis of this and other observations showed 
the effect.2* More recently signs of a drop appeared in the High Resolution 
Fly’s Eye experiment.?? In 2006 this group announced the observation of 
a “sharp suppression” of the primary cosmic ray spectrum at an energy of 
6 x 10!° GeV, just about where expected.” 


20For instance, see I. F. M. Albuquerque and G. F. Smoot, Astroparticle Phys. 25, 375 (2006) 
[astro-ph/0504088}]. 

21M. Takeda et al. Phys. Rev. Lett. 81, 1163 (1998). 

22), N. Bahcall and E. Waxman, Phys. Lett. B556, 1 (2003) [hep-ph/0206217]. 

23R. U. Abbasi et al., Phys. Rev. Lett. 92, 151101 (2004) [astro-ph/0208243]; Phys. Lett. B619, 271 
(2005) [astro-ph/0501317]. 

24G, B. Thompson, for HiRes Collaboration, in Proc. Quarks ‘06 Conf. [astro-ph/0609403]; R. U. 
Abbasi et al., astro-ph/0703099. 
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As already remarked in the previous section, if we look back in time suffici- 
ently far, we surely must come to an era when the temperature and den- 
sity were sufficiently high so that radiation and matter were in thermal 
equilibrium. We will now consider this era, jumping over the intermediate 
time when radiation was going out of equilibrium with matter, which will 
be the subject of the next section. 

As we saw in the previous section, in a free expansion of photons, the 
frequency distribution preserves the Planck black-body form (2.1.1), but 
with a temperature that falls as 1/a(t). On the other hand, in a free expan- 
sion of non-relativistic particles such as electrons or nuclei, the momentum 
distribution preserves the Maxwell—Boltzmann form, n(p)dp œ 
exp(—p”/2mkgT), but since (as shown in Eq. (1.1.23)) the momentum of 
any particle decreases as the universe expands with p œ 1/a, the temper- 
ature of the Maxwell—Boltzmann distribution decreases as 1/a’(t). So if 
radiation is in equilibrium with matter, who wins? Does the temperature 
decrease as 1/a(t), or 1/a° (t), or something more complicated? 

The issue is settled democratically, on the basis of one particle, one vote. 
Since there are so many more photons than electrons or nucleons, the pho- 
tons win, and the temperature decreases almost exactly as 1/a(t). 

We can see this in more detail by applying the second law of therm- 
odynamics. In equilibrium both the entropy and the baryon number (that 
is, at temperatures < 10!3 K, the number of protons and neutrons) in any 
co-moving volume were constant, and so their ratio, the entropy per baryon, 
was also constant. It is convenient to write the entropy per baryon as kgo, 
with kg the Boltzmann constant and o dimensionless. The second law of 
thermodynamics tells us that this satisfies 


d(kso) _ de/ng) + pd(l/nB) 221 


T ; 
where ng is the baryon number density (so that 1/ng is the volume per 
baryon), € is the thermal energy density and p is the pressure. For simplicity, 
let us consider an ideal gas of photons and non-relativistic particles (mostly 
protons, helium nuclei, and electrons), with a fixed number (of order unity) 
N of the non-relativistic particles per baryon. Then 


3 1 
€ = apT* + 5ng N ksT , p= 34%8T* +ng NksT , (2.2.2) 
with ag the radiation energy constant (2.1.7). The solution of Eq. (2.2.1) is 
here 
4agT? Por 
= N In | —~], 2.2.3 
ý 3ngkg $ Š (= ( ) 
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where C is an arbitrary constant of integration. It is the quantity o that 
remains constant in thermal equilibrium. 

We saw in the previous section that the first term in Eq. (2.2.3) is larger 
than 108 at present. (Compare Eqs. (2.1.12) and (2.1.13).) Let us tentatively 
assume that this quantity was also much larger than unity when photons 
were in equilibrium with matter. Since o was constant in this era, this 
means that the ratio T?/ng would also have been very close to constant 
during the era of equilibrium, unless the quantity 7*/*/ng changed by an 
enormous amount. For instance, if at some time in the equilibrium era the 
first term in Eq. (2.2.3) were of order 108, then in order for the first term in 
Eq. (2.2.3) to have changed by even 0.01%, to keep o constant the value of 
T?/2 /ng would have had to change by a factor e!0000/N, We are not going 
to be considering such enormous density or temperature ratios here, so we 
can conclude that T?/ng was essentially constant in thermal equilibrium. 
We saw in the previous section that this ratio was also constant when the 
photons were traveling freely, and also when photons were interacting only 
by purely elastic Thomson scattering, so it has been close to constant from 
the beginning of the era considered here to the present. If we define the 
constant C in Eq. (2.2.3) to equal the value of T3/*/ng at some typical time 
during the era of equilibrium, then the entropy per baryon throughout this 
era can be taken as 


dagT? —3.60nyo 
oz — 
3ngkp NBO 


= 1.31 x 10847? Q3! . (2.2.4) 


The conservation of baryon number tells us that nga? is constant, so the 
constancy of (2.2.3) has the consequence that T œ 1/a. (Note incidentally 
that if the first term in Eq. (2.2.3) were of order 1078 instead of greater than 
108, then it would be the logarithm in the second term that would have to 
remain constant during thermal equilibrium, in which case T « ne 3 & ae 
as expected if non-relativistic particles dominate the thermal evolution.) 
We can now see why the black-body spectrum with T œ 1/a is 
preserved as photons go out of equilibrium with matter, even when we take 
into account small inelastic effects, like the loss of energy to electron recoil 
in photon-electron scattering. Photons effectively stop gaining or losing 
energy to matter when Ty < H, where H = å/a and T, is the rate at which 
an individual photon loses or gains an energy kgT through scattering on 
electrons (to be calculated later). But the rate Fe at which an individual elec- 
tron would gain or lose an energy kgT by scattering on photons is greater 
than T, by a factor ny /ne > 108, so at the time that T, drops below H we 
still have re >> H. Thus instead of the electron kinetic energies decreasing 
like 1/a?, as they would in a free expansion, they continue to remain in 
thermal equilibrium with the photons. While in equilibrium with matter 


110 


2.2 The equilibrium era 


the photon temperature goes as 1/a, and we saw in the previous section that 
when they stop interacting with matter the temperature continues to drop 
like 1/a, so through this whole period the electron temperature also drops as 
1/a, and the last few exchanges of energy that photons have with electrons 
do not affect the photon energy distribution. 

According to results quoted in the previous section, the present ratio of 
the equivalent photon and neutrino mass density and the total matter mass 
density omo = Qu Pogcrit is 

PR. L 


R a Poe Ee) 
= —=415x10°Q,,h~. 2.2.5) 
PMO QM a l 


As we have seen, the photon and neutrino energy density varied as T4 « a~4 


even before the photons began their free expansion, while the density of 
pressureless matter varied as a~, so pr/pm varied as 1/a x T. Therefore 
the energy density of photons and neutrinos was equal to that of matter 
when the temperature was 


Ty 
yoM = 6.56 x 104K x Qyh?, (2.2.6) 


TEQ = 


and pr was greater than pm at earlier times. For Qyh? ~ 0.15, T; EQ İS 
about 104 K. 

Although collisions cannot change the distribution of photon energies 
as long as the photon number is much greater than the number of charged 
particles, at sufficiently high temperatures collisions can drastically change 
the energy of an individual photon. It is of some interest to work out when 
photons stopped exchanging energies of order kg T with electrons. The rate 
at which any individual photon is scattered by electrons is Ay = o7nec, 
where ne is the number density of electrons, and oy = 0.66525 x 10-24 
cm? is the cross section for Thomson scattering, the elastic scattering of 
photons by non-relativistic electrons. As we will see in Section 3.2, about 
76% of the matter of the universe in this era was ionized hydrogen, with the 
rest helium, completely ionized at temperatures above 20,000 K, so with 
one electron per nucleon for hydrogen and half an electron per nucleon for 
helium, the net number of electrons per nucleon is 0.76 + (1/2)0.24 = 0.88, 
and the number density of electrons at temperature T is ne ~ 0.88ng = 
0.88ng0(T/Tyo)°, with a subscript zero as usual indicating the present 
moment. Using Eq. (2.1.13), the rate at which a photon is scattered by 
electrons is 


ry PN 
Ay = 0.88ng0(——) orc =1.97 x 107 s7! x 2g (——) . 2.2.7 
Ty 0 Ty 0 
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But this is not the rate that governs the effectiveness of the energy exchange 
between matter and radiation. A photon with energy much less than mec? 
that strikes a non-relativistic electron will transfer a momentum to the elec- 
tron of the order of its own momentum, typically about kgT (in units with 
c = 1), and so it will gain or lose an energy of order (kgT)*/me. (This 
agrees with Eq. (C.20) for œ ~ kgT « me.) Hence the rate for energy 
transfer of order kgT between a given photon and the electrons equals the 
rate of collisions times the fraction of the energy kgT that is transferred per 
collision: 


T rN? 
es ke Ay © 9.0 x 1072s! Qgh? (——) . (2.2.8) 
Me Tyo 


We have to compare this with the cosmic expansion rate. Let’s tentatively 
assume that at the time that concerns us here the density of the universe was 
dominated by photons and neutrinos, not matter, in which case the cosmic 
expansion rate was 


a T N? 
H = Ż = H, /QRT4/T4 =2.1 x102 — |. 2.2, 
a = Hoy 2RT"/Tyo x 10 f s To (2.2.9) 


Therefore r, was greater than H until the temperature dropped to a value 
of order 


-1/2 
Tireere = 1.5 x 104K (22h?) l (2.2.10) 


For Qgh? ~ 0.02, this is about 10° K. (Comparison with Eq. (2.2.6) shows 
that for plausible values of Q ul? and Qgh?, as for instance Qyh? ~ 0.15 
and gh? ~ 0.02, we have Threeze > T EQ, so the temperature (2.2.10) was 
reached while the expansion rate was still dominated by radiation, as we 
have been assuming.) 

After the temperature dropped below about 10° K photons no longer 
exchanged appreciable energy with electrons, but at this temperature the 
rate (2.2.7) of elastic scattering was still roughly 10° times greater than H. 
If Eq. (2.2.7) remained valid then A, would remain larger than H until the 
temperature dropped to much lower temperature. For 3K « T « 10*K 
the universe was matter-dominated, with 


T \3?2 
H = Hy,/QuT3/T3, = 3.3 x 10-8 s! Vu (—) l 
yo 


This became equal to the rate given by Eq. (2.2.7) at a temperature T ~ 
18 K (Qyh?)'!3/(Qgh’)7/?, or about 130 K for Qyh? = 0.15 and Qgh? = 
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0.02, and until then each photon would be scattered many times by electrons 
in each doubling of a(t). This is not what actually happens, because when 
the temperature became low enough for electrons and nuclei to hold together 
as neutral atoms, the elastic scattering rate A, dropped sharply below the 
value (2.2.7). As we shall see in the next section, this was at a temperature 
of about 3,000 K, which marked the end of the era of rapid scattering of 
photons by electrons. 


2.3 Recombination and last scattering 


We saw in the previous section that photons stopped exchanging energy 
effectively with electrons when the temperature of the expanding universe 
dropped to about 10° K. After that, photons continued to be scattered 
by free electrons, but without appreciable gain or loss of energy. This 
terminated when the free electrons became bound into hydrogen and helium 
atoms, ending the scattering of photons. This is called recombination.' Let’s 
consider when this happened. 

We start our calculation at a time early enough so that protons, elec- 
trons, and hydrogen and helium atoms were in thermal equilibrium at the 
temperature of the radiation. In a gas in equilibrium at temperature T, the 
number density of any non-relativistic non-degenerate particle of type i is 
given by the Maxwell—Boltzmann formula: 


2 
ni = gi Qrp Reiter | ag exp |- (r + 2) fr] , (2.3.1) 
2m; 


where m; is the particle mass, g; is the number of its spin states, and u; is a 
characteristic of the gas known as the chemical potential of particles of type i. 
(The generalization of Eq. (2.3.1) to include the effects of relativity and/or 
degeneracy is given in the following chapter.) The property of the chemical 
potentials that make this a useful formula is that they are conserved in any 
reaction that is occurring rapidly in the gas. In our case, the particles are 
protons, electrons, and hydrogen atoms in any bound state, for which we 
take i as p, e, 1s, 2s, 2p, etc. (As already mentioned, about 24% of the mass 
of the early universe was in the form of helium nuclei, but helium atoms are 
more tightly bound than hydrogen atoms, so that at the time that concerns us 
now, say for T < 4,400 K, almost all the helium was locked up in the form of 
neutral atoms, and therefore played no role here.) The electron and proton 


' The “re” in “recombination” may be misleading; before this time electrons and protons had never 
been combined into atoms. This has become the standard term throughout astrophysics for the capture 
of electrons into atoms. 


113 


2 The Cosmic Microwave Radiation Background 


have spin one-half, so gp = ge = 2, while the 1s ground state of the hydrogen 
atom has two hyperfine states with spins 0 and 1, so gj; = 1+3 = 4. At first 
the recombination and ionization reactions p + e = Hj; occurred rapidly 
by cascades of radiative transitions through excited states, so the chemical 
potentials satisfied 

Up + Me = His - (2.3.2) 
(Photons can be freely created and destroyed in reactions like e + p = 
e +p + y, so their chemical potential vanishes.) The integrals are 


2 ket \3? 
anny fa P = (7) 
e f oo 2mkgT 2x? 


so the conservation law (2.3.2) gives 


Nig [SS 
NpNe Qnh- 


3/2 
) exp(Bi /kpT) , (2.3.3) 


where Bı = mp +me—my = 13.6 eV is the binding energy of the 1s ground 
state of hydrogen. (Here we ignore the difference between the hydrogen 
mass and the proton mass, except in the exponential.) Also, the charge 
neutrality of cosmic matter requires that 


Ne = Ny. (2.3.4) 


Further, in equilibrium the number density of hydrogen atoms in any one 
excited state is less than the number density in the ground states by a factor 
of order exp(—AE/kgT), where AE is the excitation energy, which is nec- 
essarily not less than the difference 10.6 eV in the binding energies of the 
n= l andn = 2 states of hydrogen. (Excited states had the same chemical 
potential as the ground state, since in equilibrium the atom could go rapidly 
from one to the other by emitting or absorbing photons.) For temperatures 
below 4,200 K this exponential factor is less than 6 x 107!3, so that to a 
good approximation we can neglect the presence of excited hydrogen atoms 
as long as thermal equilibrium persisted. As we will see in Section 3.2, the 
matter at the time of recombination was about 76% by weight neutral or 
ionized hydrogen, so we can take 


np + nis = 0.76ng , (2.3.5) 


where npg is the number density of baryons (i.e. at the temperatures of int- 
erest here, of neutrons and protons.) The fractional hydrogen ionization 
X = np/(np + nis) therefore satisfies the Saha equation: 


X(1+SYX) = 1 (2.3.6) 
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keT \ 3? 
s= Ce tri OTe n ("<8 ) exp(Bi/keT). (2.3.7) 


p 


One might think that the fractional ionization in equilibrium would 
become small when the temperature drops below the value By /kg = 157,894 
K, but even in equilibrium the recombination is considerably delayed, 
because of the small value of the coefficient of the exponential in Eq. (2.3.7). 
With ng = ng0(T/Tyo)* and ngo given by Eq. (2.1.13), we have 


S = 1.747 x 1077? e!7894/T 73/2 Ogh, (2.3.8) 


where A is here again the Hubble constant in units of 100 km s~! Mpc™!, 


and T is the temperature in degrees Kelvin. This function of tempera- 
ture is extremely rapidly varying where it is of order unity, so the Saha 
equation gives a quite sharp temperature of recombination, as shown in 
Table 2.1. We see from Table 2.1 that the equilibrium value of the ion- 
ization dropped from over 97% for T = 4,200 K to less than 1% for 
T = 3,000 K. 

This gives the correct order of magnitude of the temperature of the 
steep decline in fractional ionization, but it is not correct in detail, because 


Table 2.1: Equilibrium hydrogen ionization X for various values of the temperature T and 
Q phe 


T (K) Qzh? = 0.01 Qh? = 0.02 Qh? = 0.03 
4,500 0.999 0.998 0.997 
4,200 0.990 0.981 0.971 
4,000 0.945 0.900 0.863 
3,800 0.747 0.634 0.565 
3,600 0.383 0.290 0.244 
3,400 0.131 0.094 0.078 
3,200 0.0337 0.0240 0.0196 
3,000 0.00693 0.00491 0.00401 
2,800 0.00112 0.00079 0.00065 
2.725 2.8 x 10712571 2.0 x 10712571 1.6 x 107157! 
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equilibrium was not actually maintained for low ionization levels. The pho- 
ton that is emitted when a free electron is captured by a proton into the 
ground state has more than enough energy to ionize another hydrogen atom, 
so this process produces no net decrease in ionization. Similarly, the photon 
emitted when an electron in a high orbit of the hydrogen atom with principal 
quantum number n > 3 falls into the ground state has more than enough 
energy to lift an electron in the ground state of some other hydrogen atom 
to the n = 2 excited state, so this process also produces no net increase in 
the number of electrons in the ground state. The ground state of hydro- 
gen is typically reached by formation of excited states H* in the reaction 
e+p — H* +y, followed by a cascade of radiative decays down to then = 2 
excited state. The final transition from the 2p excited state to the 1s ground 
state by emission of a single Lyman « photon is impeded by the same effect 
that impedes the transition of free electrons or electrons in higher excited 
states to the ground state: the Lyman a photon does not simply merge into 
the thermal radiation background; it has a large resonant cross section for 
exciting another hydrogen atom from the ground state to the first excited 
state, from which it is most often reionized (as shown by the small equil- 
ibrium numbers of hydrogen atoms in excited states). But in cosmology this 
process is not entirely ineffective, because the Lyman a photon emitted by 
one atom will have just barely enough energy to excite another hydrogen 
atom in its ground state to the 2p state, so if it does not interact very soon 
with another atom then the cosmological redshift will take its energy out- 
side the resonant line, after which it no longer has enough energy to excite 
a hydrogen atom in its ground state. Even so, the formation of the ground 
state by radiative decay from the 2p state is so inefficient that we also have 
to consider slower pathways to the ground state, such as the formation of 
the 2s excited state, which can only decay into the ground state by emitting 
two photons, neither of which has enough energy to re-excite a hydrogen 
atom in its ground state. When the ionization became small the rate of these 
reactions could no longer compete with the cosmic expansion rate, and the 
ionization no longer fell as fast as it would in thermal equilibrium.” 


2The classic paper on this subject is by P. J. E. Peebles, Astrophys. J. 153, 1 (1968). Also see Ya. B. 
Zel'dovich, V. G. Kurt, and R. A. Sunyaev, Soviet Physics JETP 28, 146 (1969). At the time of writing the 
most thorough analysis known to me is that of S. Seager, D. D. Sasselov, and D. Scott , Astrophys. J. 523, 
L1 (1999); Astrophys. J. Suppl. Ser. 128, 407 (2000) [astro-ph/9909275]. Corrections of the order of a 
percent of the calculated value due to additional transitions and stimulated emission were subsequently 
found by V. K. Dubrovich and S. I. Grachev, Astron. Lett. 31, 359 (2005) [astro-ph/051672]; J. Chluba 
and R. A. Sunyaev, Astron. Astrophys. 446, 39 (2006) [astro-ph/0508144]; W. Y. Wong and D. Scott, 
Mon. Not. Roy. Astron. Soc. 375, 1441 (2007) [astro-ph/0610691]. The possible use of observations 
of the cosmic microwave background to resolve these uncertainties is discussed by A. Lewis, J. Weller, 
and R. Battye, Mon. Not. Roy. Astron. Soc. 373, 561 (2006) [astro-ph/0606552]. For the sake of 
simplicity here I will make the same approximations as Peebles, but will use the “escape probability” 
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This is a complicated business, but it is not hard to see the main outlines 
of the recombination process. We make the following approximations: 


1. Collisions between hydrogen atoms and radiative transitions between the 
states of these atoms are sufficiently rapid so that all states of the atoms are in 
equilibrium with each other at the temperature T of the radiation, except for 
the 1s ground state, which as already mentioned is reached only through slow 
or inefficient processes. This has the consequence that the number density 
of states n £ of hydrogen having principal quantum number n (with n > 1) 
and orbital angular momentum £ can be expressed in terms of the number 
density of any one state, say the 2s state:? 


nne = (26 + 1) ms exp ((B = By) /keT) (2.3.9) 


where B, is the binding energy* of the state with principal quantum 
number n. 


2. The net rate of change in the population of hydrogen atoms in their 1s state 
is given by the rate of radiative decay from the 2s and 2p states, minus the 
rate of excitation of these states from the 1s state. In accordance with the dis- 
cussion above, all other processes leading to the ground state are assumed to 
be canceled by the reionization or re-excitation of other atoms by the emitted 
photon. Because recombination occurs in collisions of electrons and pro- 
tons, it decreases the number nea? of free electrons in a co-moving volume a? 
at arate (T )npnea? that is proportional both to np = ne and nea, with the 
coefficient a(T) depending only on temperature, not on ne or a. (We do not 
include recombination directly to the ground state here, since it is canceled 
by the ionization of other atoms by the emitted photon, so a(f) is what in 
astrophysics is called the “case B recombination coefficient.”) Also, leaving 
aside ionization from the ground state which just cancels recombination to 
the ground state, the ionization from excited states of hydrogen increases 
nea? at a rate given by a sum of terms proportional to the npea? with n > 1, 
with coefficients depending only on temperature. Eq. (2.3.9) gives all the 
nne With n > 1 as proportional to mzs, with coefficients that also depend 
only on temperature, so ionization increases the number of electrons in a 
co-moving volume a? at a rate that can be written as B(T)n2,a3, with B(T) 


method discussed by Seager et al., which seems to me more direct and easier to justify. 

3This formula does not apply to hydrogen atoms in very high excited states, which have such large 
radii and small binding energies that they cannot be treated as free particles. See, e.g., D. Mihalas, 
Stellar Atmospheres, 2nd edition (Freeman, San Francisco, 1978). 

4We are here neglecting the fine structure, hyperfine structure, and Lamb energy shifts, which 
give the binding energies a very small dependence on £ and on the total (including spin) angular 
momentum j. 
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a function only of temperature, not of a or ne or any of the npe. Putting 
these rates together, we have 


d 
S (nea) = —ana> + B m,a? . 
Dividing by the constant na? (where n = np +n = np +}, nne = 0.76Np), 
this gives 


d @) z oan P Bnas l 
dt \n n n 

Further, this must vanish under conditions of equilibrium, where (since the 
transitions e + p = 2s occur rapidly) instead of Eq. (2.3.3) we would have 


Ms E 
ne NO Oh? 


(2.3.10) 


-3/2 
) exp(B2/kgT) , 


so the coefficients in Eq. (2.3.10) are related by 


2 keT \ 3/2 
pas [a 7 (= 5 ) exp(—B2/kgT). (2.3.11) 
N2s Bele os 20h 
equilibrium 
In order for Eq. (2.3.10) to be useful, we need to relate 12, to the total 
number density n of protons and hydrogen atoms. 


3. The total number of excited hydrogen atoms in a co-moving volume 1/n 
changes so much more slowly than individual radiative processes that the net 
increase in this number due to recombination and reionization of hydrogen is 
balanced by the net decrease in this number by transitions to and from the 1s 
state. That is, 


a ne — Bros = (T25 + 3 PV 2p)N25 — Enis (2.3.12) 


where Tzs and "7, are the rates for the radiative decay processes 2s —> 
Is +y +y and 2p > 1s + y, respectively;> P is the probability that the 
Lyman « photon emitted in the decay 2p > 1s + y will escape to infinity 
without exciting some other hydrogen atom in the 1s state back to the 2p 
state (to be calculated below); and € is the rate at which hydrogen atoms 
in the 1s state are excited to the 2s or 2p state, not including those that are 
excited to the 2p state by Lyman « photons from the decays 2p > 1s + y, 
which we take into account with the factor P. (The factor 3 in Eq. (2.3.12) 


5We neglect the process 2p > 1s + y + y, which is much slower than 2s > Is+y +y. 
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enters because Eq. (2.3.9) gives 2) = 372s.) From this, we obtain the needed 
formula for no,: 


o ga n2 +EnNns 
~ Ta + 3PP xy + B 


ns (2.3.13) 


We will be concerned here with temperatures T « (B2—B3)/kg = 21,900 K, 
so according to Eq. (2.3.9) all nne with n > 2 are much less than nz, 
and hence to a very good approximation the total number of hydrogen 
atoms is 


NH = Nis + N25 + Ny = Nis + 4na . (2.3.14) 
We can therefore eliminate 71, in favor of ny in Eq. (2.3.13), so 


_ an? +Eny 
~ Tay + 3PM 2p + B +4E 


Nos (2.3.15) 


Further, in equilibrium the number of hydrogen atoms in the 1s state would 
be constant, so the coefficients on the right-hand side of Eq. (2.3.12) (which 
gives the net rate of increase of the number density of hydrogen atoms in 
the ground state) must have the ratio 


E r2) 
~ = exp ( — (Bı — B2)/kgT) . (2.3.16 
Pas + 3PT2 - equilibrium p( UE ) ( ) 


Using Eqs. (2.3.15) and (2.3.16) in Eq. (2.3.10) now gives the rate 
equation in a more useful form 


d C = P25 + 3PT2p 
dt\n/ (Tas + 3PT 2p) [1 + 4exp ( - (Bı —- B2)/ksT )| +B 


2 
ang 


n 


z (- [1 + 4exp ( ee By) /keT) | 


+exp ( OR Bp) /ksT) eu) 


At the temperatures of interest the factor | + 4exp(—(B, — Bo)/kgT) can 
be replaced with unity. Using Eq. (2.3.11) and the definition of fractional 
ionization XY = ne/n = np/n = 1 — ny /n, we have at last 


dX _ ( Tas + 3PT 2p 


= REPS E= K Na 2.3.17 
dt eg) an +S- X) eon 
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where S(T) is the function (2.3.7) appearing in the Saha equation. Note 
that if the temperature were constant then this would be satisfied by any 
solution of the Saha equation (2.3.6). In fact, X is always larger than the 
value given by the Saha equation, so Eq. (2.3.17) gives a monotonically 
decreasing fractional ionization. The first factor in Eq. (2.3.17) represents 
the suppression of recombination that occurs when the transitions of the 
2s and 2p states to the ground state are slower than the reionization of the 
atom. 

It remains to calculate the photon survival probability P. This is given 
in general by 


+oo Cc 
P(t)= J dw P(w) exp |- J a! ms) co (waat), (2.3.18) 
—CO t 


where P(w) dæ is the probability that a photon emitted in the transition 
2p — 1s has energy between fw and h(w + dw), normalized so that f P(@) 
dw = 1, and o (w) is the cross section for the excitation 1s —> 2p by a photon 
of energy fw. The factor a(t)/a(t’) arises from the cosmological redshift of 
the photon. The cross section is given by the Breit-Wigner formula® 


2 
o(w) = (5) (=) Plo) , (2.3.19) 


where kg is the mean wave number (Bı — B2)/fic of the Lyman «œ photon 
emitted in the transition 2p — 1s. The probability density here is 


Pop 1 
2m (w — wa)? +13,/4 


P(w) = (2.3.20) 


where @, is the circular frequency cka corresponding to the wave number 
ka. 

Now, if a photon is captured at all then it is captured in a time much 
less than the characteristic expansion time of the universe, so we can set the 
density n1s(t) in Eq. (2.3.18) equal to nıs(t). The cross section o (w) varies 
very rapidly with w, so we cannot neglect the time-dependence of a(t), but 
we can take the expansion rate to be constant in the integral over 7’, so that 
a(t)/a(’) = 1— HA — t), where H(t) = a(t)/a(t). It is convenient then 


®See, e.g., S. Weinberg, The Quantum Theory of Fields, (Cambridge University Press, Cambridge, 
UK, 1995, 1996, 2000) [referred to henceforth as QTF]: Vol. I, Eq. (3.8.16). The factor 3/2 arises from 
the number 2¢ + 1 = 3 of 2p angular momentum states, and the number 2 of photon helicity states. If 
the spin of the electron and proton are taken into account then the numerator is 12 and the denominator 
is 8, but the ratio is the same. 
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to change the variable of integration in the exponential in Eq. (2.3.18) from 
t tow’ = (1 — H(t)’ — 1))a, so that Eq. (2.3.18) reads 


+00 2 w 
P(t) = J dw P(w) exp | pee J do! P| , (2.3.21) 


The function P (œ) is negligible except for œ very near wg, so we can replace 
œw With œx in the factor 1/w in the argument of the exponential (but not in 
the upper limit of the integral over w’, which rises steeply from zero to one 
for œ in the neighborhood of wy.) The integral over w is now trivial, and 
gives 


(2.3.22) 


2 
Pi) =F (rene) | 


wa H (t) k2 
where 
F(x) = (1 —e™™)/x. (2.3.23) 


The Lyman a transition rate’ is ">, = 4.699 x 108 s71, so that the argument 
of the function F(x) in Eq. (2.3.22) is very large. For instance, if we take 
Qyh? = 0.15 and Rgh? = 0.01, then for T < 6,000 K the argument x is 
greater than 10. Hence for the temperatures of interest here we can drop 
the exponential in Eq. (2.3.23), and find 


_ oHk  8xH 
T 3x? Tapis C = 3A3T 2p M15 í 


(2.3.24) 


where Ay = 1215.682 x 1078 cm is the wavelength of Lyman a photons, and 
the argument ¢ is now understood. We see that the quantity 3PT2p in the 
rate equation (2.3.17) is independent of I): 


82 H 


S 2.3.25 
23 Nis ( ) 


3PT2p = 
With this result for P, equation (2.3.17) is the same as the rate equation 
found in a different way by Peebles.? In this equation we may use the appr- 
oximation that at the temperatures of interest, which are much less than 
(Bı — B2)/kg = 118,420 K, even though 71,/n2; is less than it would be in 
thermal equilibrium it is still much larger than unity, so that we can replace 
nis with ny = (1 — X)n. We also replace time with temperature as the 
independent variable, using 


d _ 1 


SS Sa 2.3.2 
dT HT ee?) 


TW. L. Wiese, M. W. Smith, and R. M. Glennon, Atomic Transition Probabilities, Vol. I, National 
Standard Reference Data Series NBS 4 (1966). 
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The rate equation (2.3.17) then becomes 


dX œn 14 B 
dT HT Fos + 82 H/A3n(1 


=. [x?-a-x)/s| . (2.3.27) 


Now we must put in some numbers. In calculating the expansion rate we 
need to include both the energy density of non-relativistic matter and that of 
neutrinos and photons, but at temperatures where neutrinos are important 
their mass is negligible, and for temperatures T > 30 K we may neglect 
vacuum energy, so 


1/2 

T \’ rN” 
H = Ho | Qm | =— | +2r | = 
Tyo Ty 0 


= 7.204 x 10719 73/2. /Qyh? + 1.523 x 10-57 s7! , (2.3.28) 


where again in all numerical expressions T is the temperature in degrees 
Kelvin. The number density of ionized and un-ionized hydrogen is 


3Hi QB 


= 0.76 
82 GMp 


T 3 
(—) = 4.218 x 10-7 Qgh? T? cm? . (2.3.29) 
Tyo 


The two-photon decay rate of the 2s state isë 
Poy = 8.22458 s7! . (2.3.30) 


The coefficient of the proton number density in the electron recombination 
rate used by Peebles is? 


a = 2.84 x 10° T! cm? s7! . 


The factor T~!/2 here is what would be expected from a factor velocity~! 


that generally appears in the cross sections for exothermic reactions like e + 
p— H+y.!° Actually a represents a chain of reactions more complicated 
than just the radiative capture of an electron, including the cascade of radia- 
tive decays down to the 2s and 2p states, and so it has a more complicated 
temperature dependence. A variety of detailed numerical calculations of the 
effective rate of recombination to the 2s and 2p states can be fit 


8S. P. Goldman, Phys. Rev. A40, 1185 (1989). Peebles used an earlier value, 8.227 sv], given by 
L. Spitzer and J. L. Greenstein, Astrophys. J. 114, 407 (1951). 

°w. J. Boardman, Astrophys. J. Suppl. 9, 185 (1964). 

10sec, e.g., QTE, Vol. I, p. 157. 
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with the simple formula:!! 


1.4377 x 107!° Oem s~! 


as 1 + 5.085 x 10-3 79.5300 (2.3.31) 


The value of £ is given in terms of «œ by Eq. (2.3.11): 


= (as 
Xan? 


3/2 
) exp(—Bo/kgT) a = 2.4147 x 10’? cm~? T3/? @ P44 Fy, 
(2.3.32) 


Finally, the function S[T] is given by Eq. (2.3.8). 

The values of X[T] calculated!? from the differential equation (2.3.27) 
with the inputs (2.3.28)-(2.3.32) (and Eq. (2.3.8) for S[T]) are given in 
Table 2.2 for Qyh? = 0.15 and a range of values of Qgh?. The initial 
condition here is taken to be that X[T] is given by the solution of the 
Saha equation (2.3.6) for thermal equilibrium at the highest temperature 
considered, which we take to be T = 4,226 K (i.e. z = 1, 550), high enough 
so that thermal equilibrium should be a good approximation (because the 
equilibrium value of X is very close to one, while the true value must be 
between one and the equilibrium value), but low enough so that hardly any 
of the helium is still ionized. (Almost all helium was doubly ionized until 
the temperature dropped below 20,000 K, and there was still appreciable 
singly ionized helium until the temperature dropped below about 4,400 K.) 
The actual value of X [4226] is slightly higher than the equilibrium value, 
but to three significant figures there would be no effect on the results at 
lower temperatures if we increased the assumed value of X [4226] by a small 
amount, as say from 0.984 to 0.99 for Qgh? = 0.02. 

Comparison of Tables 2.1 and 2.2 shows that for the small values of 
Ngh? considered here, the Saha equation stopped giving a good approxi- 
mation to the fractional ionization as soon as the equilibrium ionization 
dropped appreciably below unity. In particular, although in equilibrium 
the fractional ionization would have dropped to vanishingly small values 
for temperatures below 2,000 K, the ionization calculated from Eq. (2.3.27) 
leveled off at low temperatures to a small but non-zero asymptotic value, 


Ip, Péquignot, P. Petijean, and C. Boisson, Astron. Astrophys. 251, 680 (1991). This includes an 
over-all “fudge factor” of 1.14 recommended by S. Seager, D. D. Sasselov, and D. Scott, Astrophys. J. 
523, L1 (1999). Eq. (2.3.31) agrees well with subsequent calculations of D. G. Hummer, Mon. Not. Roy. 
Astron. Soc. 268, 109 (1994). 

121 thank D. Dicus for the calculation of these numerical results. The calculation is stopped at z = 10, 
because at later times hydrogen is reionized by the first generation of stars, and also the rate of expansion 
is affected by the vacuum energy. 
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Table 2.2: Hydrogen ionization X calculated from Eq. (2.3.27) and time ¢ for the temp- 
erature to drop to T from 10°°K calculated from Eq. (2.3.26). The fourth through sixth 
columns give results for various values of Q gh?, with Qyh? = 0.15. The last column gives 
results for parameters Rgh? = 0.02238, Qyl? = 0.13229 used in Section 7.2 to compare 
the analytic calculation of microwave background anisotropies with a numerical calculation. 


z TW) t(yrs) X95h2=0.01  Xogk=0.02 Xagn=0.03  XSec.7.2 
1550 4226 202,600 0.992 0.984 0.977 0.982 
1500 4090 213,200 0.976 0.958 0.943 0.954 
1450 3954 225,900 0.935 0.902 0.878 0.895 
1400 3818 239,800 0.861 0.815 0.780 0.805 
1350 3681 255,200 0.759 0.703 0.659 0.690 
1300 3545 272,000 0.645 0.580 0.529 0.564 
1250 3409 290,600 0.526 0.456 0.402 0.437 
1200 3273 311,300 0.409 0.339 0.289 0.321 
1150 3136 334,600 0.299 0.236 0.194 0.220 
1100 3000 360,400 0.205 0.154 0.122 0.142 
1050 2864 389,600 0.129 0.0928 0.0721 0.0846 
1000 2728 422,600 0.0752 0.0520 0.0396 0.0470 
950 2591 460,500 0.0405 0.0270 0.0203 0.0243 
900 2455 503,600 0.0210 0.0136 0.0101 0.0121 
800 2183 611,400 0.00662 0.00387 0.00276 0.00339 
700 1910 761,300 0.00319 0.00174 0.00120 0.00150 
600 1638 977,700 0.00203 0.00107 0.000731 0.00920 
500 1365 1.312 x 10° 0.00147 0.000762 0.000517 0.000653 
400 1093 1.872 x 10° 0.00114 0.000585 0.000395 0.000499 
250 684 3.922 x 10° 0.000829 0.000423 0.000285 0.000361 
100 275 1.604x 107” 0.000632 0.000321 0.000216 0.000273 

50 139 4.535x 107 0.000579 0.000294 0.000197 0.000250 

10 30.0 4.568 x 108 0.000537 0.000272 0.000183 0.000231 
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Figure 2.2: The fractional ionization given by the rate equation (2.3.17) as a function of 
temperature in degrees Kelvin, for Qyh2 = 0.132, Rgh? = 0.0224. 


due to the increasing rarity of encounters of the few remaining free pro- 
tons and electrons. This residual ionization played an important role in the 
formation of the first stars. 

The fractional ionization given by Eq. (2.3.17) for a currently favored set 
of cosmological parameters is also shown in Figure 2.2. On the scale of this 
figure, the results are indistinguishable from those given by the more elab- 
orate calculations of Seager, Sasselov, and Scott,? except for temperatures 
above T > 4, 300 K, where the contribution of electrons from the ionization 
of helium (ignored in Eq. (2.3.17)) was still significant. 

The most important application of these results for the fractional ion- 
ization is the calculation of the opacity O(T), the probability that a photon 
present at a time t(T) when the temperature is T will undergo at least one 
more scattering before the present, given by 


t 
O(T) = 1 — exp -J i COT Nelt) dr . (2.3.33) 
(T) 


This rises from near zero at low temperature, where the integral in the expo- 
nent is small, to near one at high temperature, where the integral becomes 
large. 
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It is convenient to convert the integral over time to an integral over 
temperature, again using Eq. (2.3.26): 


T 
O(T) = 1 — exp E or f ne(T") ar Ht" | (2.3.34) 
2.725 

We use Eq. (2.3.28) for H and take ne = Xn with n given by Eq. (2.3.29), so 


T 
O(T) = 1 — exp |- or f ne(T") ar HVT" | 
2.725 


ow l [ 0.01168 Qgh2 T! X(T’) dT’ 
— — ex — 
2 


. (2.3.35) 
725 /Qyh2 + 1.523 x 1075 T’ | 


In studies of anisotropies in the cosmic microwave background we are par- 
ticularly interested in when the photons observed today were last scattered. 
The probability that the last scattering of a photon was before the temper- 
ature dropped to T is 1 — O(T), and the probability that the last scattering 
was after the temperature dropped further to T — dT is O(T — dT), so the 
probability that the last scattering ofa photon was at a temperature between 
T and T — dT is 


1 — (1 — O(T)) — O(T — dT) = O'(T)dT . (2.3.36) 


The opacity function O(7) increases monotonically with temperature from 
O = Qat T = To to O —> 1 for T > œ, so O'(T) is a positive normalized 
probability distribution, with unit integral. Values!? of O'(T) for Qyh? = 
0.15 and a range of values of Qgh? are given in Table 2.3. The distribution 
O'(T) is peaked at a temperature Tz, with a standard deviation o, given in 
the two bottom rows of Table 2.3. 


x k k 


It may be useful to note an approximation!? for the fractional ionization, 
even though it was not used in the calculations of Tables 2.2 or 2.3. In equi- 
librium the fractional ionization depends only on the temperature and Qh”. 
But as soon as the Saha equation stopped giving a good approximation to 
the fractional ionization, the ionization then depended not only on Qgh’, 
but also on Qyh?. The approximation derived below shows that at suf- 
ficiently low temperatures the fractional ionization depends on Qgh? and 
Qyh? chiefly through a multiplicative factor (Qyh?)!/*/ Qgh, while the 
opacity O’(T) is nearly independent of these parameters. 


13B, J. T. Jones and R. F. Wyse, Astron. Astrophys. 149, 144 (1985). 
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Table 2.3: The normalized probability distribution O’(T) of the temperature of last 
scattering, as a function of the temperature T (in degrees Kelvin), calculated from 
Eq. (2.3.35). The second through fourth columns give results for various values of Qgh?, 
with Qyh? = 0.15. The last column gives results for parameters Qzh? = 0.02238, 
Qyh? = 0.13229 used in Section 7.2 to compare the analytic calculation of microwave 
background anisotropies with a numerical calculation. The bottom two rows give the 
parameters for a fit of O’(T) to the Gaussian exp[—(T — TL)? /20?]/o v27, found by 
setting Ty, equal to temperature at which O’ is a maximum, and 1/o/2z equal to the value 
of O’ at that maximum. 


/ / / 
TK) Ooph = 0.01 Oop = 0.02 Ooph = 0.03 OSe0.7.2 


4000  6.80x 107? =5.39x 10719 1.08 x 107! 5.75 x 1071! 
3500  2.26x 1074 4.12 x 1079 1.15 x 1075 2.53 x 1075 


3400 0.000451 0.000152 0.000069 0.000112 
3300 0.000759 0.000412 0.000262 0.000345 
3200 0.00109 0.000826 0.000664 0.000759 
3100 0.00132 0.00127 0.00118 0.00124 
3000 0.00139 0.00155 0.00157 0.00156 
2900 0.00127 0.00154 0.00164 0.00158 
2800 0.00102 0.00130 0.00142 0.00135 
2700 0.000746 0.000965 0.00107 0.00101 
2600 0.000502 0.000650 0.000721 0.000680 
2500 0.000320 0.000411 0.000455 0.000429 


2000  4.66x1075 5.16 x 1075 5.39 x 1075 5.25 x 1075 
1000  9.50x 107 9.76 x 1076 9.87 x 1076 9.84~x 1076 


TiK) 3017 2954 2930 2941 
o(K) 287 253 241 248 


To derive this approximation, note first that as soon as the fractional 
ionization dropped well below its equilibrium value, the term (1 — X)/S in 
the square brackets in Eq. (2.3.27) (which would equal X? in equilibrium) 
became much less than the term X*. Comparison of Tables 2.1 and 2.2 
shows that for a plausible range of cosmological parameters, this is the case 
for T < 3,400 K. At such temperatures we can also neglect the radiation 
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energy density compared with the mass density, so that the expansion rate 
(2.3.28) becomes simply 7.2 x 107!? /Qyh? s71. Also, for temperatures in 
the range from 3, 400 K down to 2000 K the effective 2p decay rate 3PT2p was 
less than the 2s decay rate, while for T < 2,000 K the reionization rate B was 
less than either 3PT2p or P25, so for all temperatures below T < 3,400 K, 
Eq. (2.3.27) took the form 


dX 
T” FCT (2.3.37) 
where 
BNT! Qh? 
T 1 = 5 2 (T 2.3.38 
e l £ É) T), (2.3.38) 
with g (T) a function of temperature alone 
842 T 701190 
T) = z. (233s 
ED = T5085 x 103 TOS 4422x 101 Tee aT: (23-39) 
The solution is 
i Oph? 3400 =l 
X(T) = | X38400) + —— r T^) dT’ 2.3.40 
(T) ~ | ¥ (3400) +o |, gT’) (2.3.40) 


Comparison of this formula with the results shown in Table 2.2 for Qm = 
0.15and Qg = 0.02 shows that the error introduced by these approximations 
rises to about 25% as the temperature drops from 3,400 K to ~ 2, 500 K, 
and then drops to less than 10% for T < 1,400 K. For temperatures less 
than about 2, 600 K the fractional ionization is so much less than at 3, 400 K 
that we can neglect the term X (3400)~! in the denominator of Eq. (2.3.36) 
without introducing any appreciable additional error, giving 


-1 


Q h2 1/2 3400 
X(T) ~ a / (T^ ar (2.3.41) 


so that X(T) is proportional at sufficiently low temperature to (Qyh?)!/2/ 
Qgh?, as was to be shown. Inspection of Table 2.2 confirms that for 
T < 1,700 K and with Qyh? = 0.15 fixed, the fractional ionization is 
indeed inversely proportional to Rgh? with fair accuracy, but this is not a 
good approximation in the neighborhood of 3,000 K. 

It is striking that the values of O’(T) depend only weakly on Qgh? for 
T < 3,200 K. The reason, as noted by Jones and Wyse,!? is that at temper- 
atures low enough to neglect the radiation energy density, Eq. (2.3.35) gives 
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In O(T) as proportional to Qgh?// Qyh? times an integral of T!2X (T), 
while the approximate formula (2.3.41) gives X(T) proportional at low 
temperature to y Qyh?/Q gh’, soat sufficiently low temperature O(T) and 
hence O'(T) are independent of Q gh? and also of Qyh. Table 2.3 shows 
that for plausible values of cosmological parameters, the temperature Tr 
of last scattering is always close to 3,000 K, with a spread (in the sense of 
standard deviation) of about 10%. 


2.4 The dipole anisotropy 


In the previous sections of this chapter we have treated the cosmic microwave 
background as perfectly isotropic and homogeneous. This is certainly a 
good approximation. Indeed, in the discovery of the cosmic microwave 
background in 1965, the one thing that enabled Penzias and Wilson to dis- 
tinguish the background radiation from radiation emitted by earth’s atmos- 
phere was that the microwave background did not seem to vary with direction 
in the sky. 

Of course, the cosmic microwave background does have small varia- 
tions in direction that are too small to have been detected by Penzias and 
Wilson. The departures from perfect isotropy provides some of the most 
important information we have about the evolution of the universe. This 
section deals with the simplest and earliest detected departure from isotropy 
of the observed cosmic microwave background, arising from the earth’s 
motion. The following two sections deal with anisotropies due to scat- 
tering of photons by intergalactic electrons in clusters of galaxies, and with 
the primary anisotropies left over from the early universe. 

To the extent that the cosmic microwave background is itself perfectly 
homogeneous and isotropic, it provides a frame of reference for the whole 
universe, with respect to which we can measure the peculiar velocities of 
individual galaxies. To analyze the measurement of our own galaxy through 
the cosmic microwave background, it is useful to consider the density N, (p) 
of photons in phase space, defined by specifying that there are N, (p) dp 
photons of each polarization (right or left circularly polarized) per unit 
spatial volume in a momentum-space volume d?p centered at p. Since |p| = 
hv/c and the momentum-space volume between frequencies v and dv is 
An h’v?dv/c?, Eq. (2.1.1) gives 


1 nr(clp|/h) E 1 1 


N, (p) = 24rh3v2/c3 WB exp (\plc/kT) — 1” 


(2.4.1) 


(The factor 1/2 takes account of the fact that ny includes both possible pho- 
ton polarization states.) This is of course the density that would be measured 
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by an observer at rest in the radiation background. The phase space volume 
is Lorentz invariant, and the number of photons is also Lorentz invariant, 
so N; is a scalar, in the sense that a Lorentz transformation to a coordinate 
system moving with respect to the radiation background that takes p to p’ 
also takes Ny, to No, where 


N, = Ny (p) - (2.4.2) 


If the earth is moving in the three-direction with a velocity (in units of c) of £, 
and we take p to be the photon momentum in the frame at rest in the cosmic 
microwave background and p’ to be the photon momentum measured on 
the earth, then 


pi 10 0 0 Pi 
p |_1{1010 0 P 
B| 00y By Py |? ee) 
|p| 00 By y IP'| 


where as usual y = (1 — 6?)~!/?. In particular 


ipl = (1 + 8 cos 6) ip'| (2.4.4) 


where 6 is the angle between p’ and the three-axis. Thus 


1 1 


N! (pA = = —, 2.4.5 
yO) apen] Ea) 
where the temperature is a function of the angle between the direction of 


the photon and the earth’s velocity 


T 
(ees (2.4.6) 


y(1 + B cos 6) l 


Since the galaxy can be expected to be moving at a velocity of several hun- 
dred kilometers per second, comparable to the peculiar velocities observed 
for other galaxies relative to the mean Hubble flow, and the solar sys- 
tem is moving with a similar velocity within the galaxy, we expect 6 to 
be roughly of order 1073, in which case y is essentially unity. The appar- 
ent temperature is greatest if we observe photons coming from the direc- 
tion toward which the earth is moving, for which cos@ = —1, where it is 
greater than the intrinsic temperature by a fractional amount Beaty. It is 
least if we observe photons moving in the same direction as the earth, for 
which cos @ = +1, and the temperature is decreased by the same fractional 
amount. 
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This effect was first observed in 1969 with a ground-based radiometer, 
but at the time it was only possible to measure the component of the earth’s 
velocity in the earth’s equatorial plane, found to be 350 km/sec in a direction 
corresponding to right ascension 11 h 20 m.! The full velocity vector of the 
earth was measured in 1977 by a Berkeley group,” using measurements from 
a U2 aircraft flying above most of the earth’s atmosphere. Our knowledge 
of this effect has been greatly improved by measurements from the COBE 
satellite. The Far Infrared Absolute Spectrophotometer group? found a 
maximum temperature increase AT of 3.372 + 0.014 mK (95% confidence 
level) in a direction with galactic coordinates* € = 264°.14 + 0°.30, b = 
48°.26 + 0°.30; the Differential Microwave Interferometer group? found 
AT = 3.353 + 0.024 mK (95% confidence level) in a direction with galactic 
coordinates £ = 264°.26 +0°.33, b = 48°.22+0°.13, corresponding to right 
ascension 11512™.2 + 0™.8, declination —7°.06 + 0°.16. 

More recently, the WMAP satellite experiment® (discussed in detail in 
Chapter 7) has given a maximum temperature increase of 3.346+0.017 mK 
in a direction £ = 263°.85+0°.1, b = 48°.25+0°.04. These results indicate a 
motion of the solar system with a velocity (0.00335)c/(2.725) = 370 km/sec, 
not quite in the direction of the Virgo cluster, which has £ ~ 284 and b ~ 74. 
For comparison, the rotation of the galaxy gives the earth a velocity relative 
to the center of the galaxy of about 215 km/sec, more or less in the opposite 
direction. Taking this into account gives a net velocity of the local group 
of galaxies’ relative to the microwave background of 627 + 22 km/sec in a 
direction (€ = 276° + 3°, b = 30° + 3°) between the Hydra and Centaurus 
clusters of galaxies. 

Expanding Eq. (2.4.6) in powers of 6, the temperature shift can be 
expressed as a sum of Legendre polynomials 


; p? 28? 
AT=T -T=T ie BP\(cos@) + =g algos 2) +... |. (2.4.7) 


Because 6 = 370 km/sec/c = 0.0013 is small, the temperature shift is 
primarily a dipole, but Eq. (2.4.7) also exhibits a “kinematic quadrupole” 


1E, K. Conklin, Nature 222, 971 (1969). 

2G. F. Smoot, M. V. Gorenstein, and R. A. Muller, Phys. Rev. Lett. 39, 898 (1977). 

3D. J. Fixsen et al., Astrophys. J. 473, 576 (1996). 

4The galactic coordinate b is the angle between the line of sight and the plane of our galaxy, so that 
the north galactic pole is at b = 90°; the galactic coordinate £ is the azimuthal angle around the axis of 
rotation of our galaxy, with the center of the galaxy at £ = 0°. 

5C. L. Bennett et al., Astrophys. J. 464, L1 (1996). 

6C. L. Bennett et al., Astrophys. J. Suppl. 148, 1 (2003). 

1G. F. Smoot, C. L. Bennett, A. Kogut, J. Aymon, C. Backus et al., Astrophys. J. 371, L1 (1991); 
A. Kogut, C. Lineweaver, G. F. Smoot, C. L. Bennett, A. Banday et al., Astrophys. J. 419, 1 (1993). 
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term that is not much smaller than the intrinsic quadrupole term in the 
temperature, to be discussed in Section 2.6. 


2.5 The Sunyaev—Zel’dovich effect 


There is another contribution to the anisotropy of the cosmic microwave 
radiation background, due to the scattering of this radiation by electrons in 
intergalactic space within clusters of galaxies along the line of sight. This 
is known as the Sunyaev-Zel’dovich effect.) Eq. (C.26) (the Kompaneets 
equation) of Appendix C shows that scattering of the cosmic microwave 
background by a non-relativistic” electron gas changes the observed photon 
occupation number N (w) (defined so that 4r w? N (w) dw is the number of 
photons of each of the two polarization states with energy between fiw and 
h(w + dw)) at photon energy iw « mec? at a rate (here in cgs units) 


4 3N (w) 


(62) 


0 
Tend = [kote 


Ñ (w) = 


4 
A + hot N(w) (1 +)) | 


where o7 is the Thomson scattering cross section, ne is the electron number 
density, and Tọ is the electron temperature. This formula can be used dir- 
ectly in calculating the rate of change of the occupation number N (œ) of a 
homogeneous isotropic photon gas through interactions with a Maxwell- 
Boltzmann distribution of electrons at temperature Te. In the applic- 
ation that concerns us now, we are interested instead in the change of the 
appearance to us of the cosmic microwave background due to scattering by 
a cloud of electrons along the line of sight. In this context, we rewrite the 
Kompaneets equation as 

neor ð 


3 

ae DE Me C2 w dw 

[kotoo roD + hoot Nw, £) (1 sos 0) | 
w 


(2.5.1) 


where £ is the proper distance coordinate along the line of sight through the 
cloud. 

The ionized plasma in clusters of galaxies is typically at temperatures 
greater than 10° degrees, so that kg Te is very much greater than the typical 


lYa. B. Zel'dovich and R. A. Sunyaev, Astrophys. Space Sci. 4, 301 (1969); R. A. Sunyaev and 
Ya. B. Zel’dovich, Comments Astrophys. and Space Physics 2, 66 (1970); 4, 173 (1972). 

2The ionized gas in galaxy clusters is hot, so that relativistic corrections, though small, are not 
negligible; see Y. Rephaeli, Astrophys. J. 445, 33 (1995); A. Challinor and A. Lasenby, Astrophys. J. 
499, 1 (1998); N. Itoh, U. Kohyama, and S. Nozawa, Astrophys. J. 502, 7 (1998). 
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value of the photon energy fiw, which for the cosmic microwave background 
is 1074 eV to 107° eV. In this case Eq. (2.5.1) simplifies to 


Ne(LoorkpTe(L) a Gi IN (a, 2] 


ð 
ap 8) = (2.5.2) 


Me C2 w dw Jw 


This is now a linear differential equation, so it can be solved exactly,? but in 
the usual case this is unnecessary, because the cloud of electrons is optically 
thin. Eq. (2.5.2) then tells us that the change in the occupation number due 
to passage of the radiation through the cloud is simply 


ð ƏN 
A EEE | (2.5.3) 
w dw Jw 
where y is the dimensionless parameter 
pa J dl ne(OkgTe(®) , (2.5.4) 
Mec 


the integral being taken along the line of sight through the cloud. For 
black-body radiation at temperature T, we have 


1 


N(o) = : (2.5.5) 
exp (ho/ksTy) -1 
Using this in Eq. (2.5.3) gives 
7 2 
AN=y ( us E zea) , (2.5.6) 
sinh“ (x /2) 


where x = fiw/kgT,. We see that in general the scattering changes the 
shape of the photon energy distribution, not just the effective tempera- 
ture. The characteristic dependence on w of Eq. (2.5.6) makes it possible 
in principle for radio astronomers to distinguish between anisotropies due 
to the Sunyaev—Zel’dovich effect and the primary anisotropies discussed in 
Section 2.6 and Chapter 7. 

But in the Rayleigh—Jeans part of the spectrum, where x < 1, Eq. (2.5.6) 
gives AN — —2y/x, while Eq. (2.5.5) gives N — 1/x, so the shape of 
the spectrum is preserved, with a fractional change in photon temperature 
equal to 

AT, AN 
T, N 


-2y . (2.5.7) 


3Ya. B. Zel'dovich and I. D. Novikov, Relativistic Astrophysics, Volume 2: The Structure and Evolution 
of the Universe, transl. L. Fishbone (University of Chicago Press, Chicago, 1983): Eq. (8.7.3). 
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It may seem surprising that scattering of photons by a cloud of electrons 
with much higher temperature would lower the photon temperature, but 
this is only in the Rayleigh—Jeans part of the spectrum. Far beyond the 
peak in the black-body spectrum, where x >> 1, Eqs. (2.5.5) and (2.5.6) give 
AN/N the positive value yx?; the cooling observed in the Rayleigh—Jeans 
region is due to the transfer of photons from low to high energy. Indeed, 
Eq. (2.5.3) shows that f AN (w)w~dw vanishes, so scattering by the electron 
cloud preserves the total number of photons we receive. 

Eq. (2.5.7) gives the fractional change in photon temperature observed at 
low frequency due to scattering in a cloud of electrons, typically associated 
with a cluster of galaxies. If the cluster is at a redshift z, then both T, and 
AT), are reduced by a factor (1 +z)~ l but the ratio is independent of z, and 
independent also of the Hubble constant and other cosmological param- 
eters, except in so far as these quantities are needed in estimating ne and 
proper lengths. The electron temperature T, can be measured from obser- 
vations of the luminosity of the cluster as a function of photon wavelength, 
with a result that is independent of the cluster redshift or Ho, but the proper 
length along the line of sight has to be inferred from the angular size of the 
cluster and the angular diameter distance d4, which is inversely proportional 
to Ho, so the electron density calculated from measurements of the Sunyaev— 
Zel'dovich parameter y is proportional to the value assumed for Ho. 

Measurements of the Sunyaev—Zel’dovich effect can usefully be com- 
bined with measurements of the X-ray luminosity of the cluster of galaxies.* 
The cross section? do for the bremsstrahlung of photons of energy between 
how and h(w + dw) (with fiw comparable to but not very close to mev? /2) in 
collisions between non-relativistic electrons of velocity v (with 1/137 « 
v < 1) and atomic nuclei of small velocity is proportional to v-*dw/a; 
the density of protons equals the density of electrons; and v is typically of 
order (kgT /me)'/?; so the rate at which an antenna of radius R receives 
bremsstrahlung photons of wavelength A in this energy range and in the res- 
olution solid angle (A/R)? from an optically thin plasma cloud with electron 
temperature T and number density ne is 


[R2/d2 a2 A/R] fu ludojn? x 2 (=) / dé T7 Pn? | 
(02) 
(2.5.8) 


Hence the value of ne inferred from measurements of the X-ray lumi- 


nosity and the angular size of the cluster is proportional to the assumed 


value of Hy! 2 The requirement that the value of ne inferred from the 


4J, Silk and S. D. M. White, Astrophys. J. 226, L103 (1978); A. Cavaliere, L. Danese, and G. De Zotti, 
Astron. Astrophys. 75, 322 (1979). For a review, see M. Birkinshaw, Phys. Rep. 310, 97 (1999). 

5See, e.g., V. B. Berestetskii, E. M. Lifshitz, and L. P. Pitaevskii, Quantum Electrodynamics, 2nd 
edition, transl. by J. B. Sykes and J. S. Bell (Pergamon, Oxford, 1982). 
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X-ray luminosity should agree with the value obtained from the Sunyaev— 
Zel'dovich effect thus allows an estimate of Hp. For example, study of two 
galaxy clusters with z ~ 0.55 (with d4 calculated assuming Qy = 0.3 
and Q, = 0.7) has given a Hubble constant of 6319 + 21 km/sec/Mpc, 
with the first uncertainty statistical and the second systematic, both with 
68% confidence.® Any such measurement depends so much on assumptions 
about the shape of the galaxy cluster and about the clumpiness of the elec- 
tron distribution, that it seems likely that measurements of the Sunyaev— 
Zel'dovich effect and X-ray luminosities will be more useful in providing 
information about galaxy clusters than in fixing the Hubble constant. In 
one respect, the Sunyaev—Zel’dovich effect is a nuisance — it will interfere 
with future measurements of the correlation between the primary temper- 
ature fluctuations at very small angular separation.’ But if combined with 
a model of structure formation, such as discussed in Chapter 8, observa- 
tions of the Sunyaev—Zel’dovich effect in small angle correlations of the 
microwave background temperature fluctuations can provide useful cosmo- 
logical information.’ Observations of such small-angle correlations’ have 
been interpreted as due to the Sunyaev—Zel’dovich effect. !° 


2.6 Primary fluctuations in the microwave 
background: A first look 


In the two previous sections we have considered anisotropies in the cos- 
mic microwave background that arise from effects in the recent universe: 
the motion of the earth relative to the cosmic microwave background, and the 
scattering of light by intergalactic electrons in clusters of galaxies along the 
line of sight. Now we turn to general anisotropies, including the highly 
revealing primary anisotropies that have their origin in the early universe.! 
It is convenient to expand the difference AT (A) between the microwave 
radiation temperature observed in a direction given by the unit vector ñ and 
the present mean value To of the temperature in spherical harmonics Y;"(n): 


ATA =T®-D=F am), Tos ef PATH, 2.6.) 
tm 


6E. D. Reese et al., Astrophys. J. 533, 38 (2000). 

TY, Rephaeli, Astrophys. J. 245, 351 (1981); S. Cole and N. Kaiser, Mon. Not. Roy. Astron. Soc. 
233, 637 (1988). 

ŠE. Komatsu and T. Kitayam, Astrophys. J. 526, L1 (1999). 

9B. S. Mason et al., Astrophys. J. 591, 540 (2003). 

10E, Komatsu and U. Seljak, Mon. Not. Roy. Astron. Soc. 336, 1256 (2002); J. R. Bond et al., 
Astrophys. J. 626, 12 (2005). 

1P. J. E. Peebles and J. T. Yu, Astrophys. J. 162, 815 (1970); R. A. Sunyaev and Ya. B. Zel’dovich, 
Astrophys. Space Sci. 7, 20 (1970); Ya. B. Zeľdovich, Mon. Not. Roy. Astron. Soc. 160, 1 (1972). 
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the sum over £ running over all positive-definite integers, and the sum over 
m running over integers from —£ to £. Since AT (7) is real, the coefficients 
dem Must satisfy the reality condition 


Agp = le —m - (2.6.2) 


(We are defining the spherical harmonics so that Yj”(n)* = Y,""(n).) As we 
saw in Section 2.4, the earth’s motion contributes to AT (A) an anisotropy 
that to a good approximation is proportional to P1 (cos 8) « y? (6, ġ) (with 
the z-axis taken in the direction of the earth’s motion), so the main aem 
produced by this effect is that with £ = 1 and m = 0. 

The coefficients agm reflect not only what was happening at the time of 
last scattering, but also the particular position of the earth in the universe. 
No cosmological theory can tell us this. The quantities of greatest cosmolog- 
ical interest are averages, which may be regarded either as averages over the 
possible positions from which the radiation could be observed, or averages 
over the historical accidents that produced a particular pattern of fluctu- 
ations. The ergodic theorem described in Appendix D shows that, under 
reasonable assumptions, these two kinds of average are the same. These aver- 
ages will be denoted (---). As discussed in Chapter 10, for anisotropies that 
arise from quantum fluctuations during inflation, it is these averages over 
historical accidents that are related to quantum mechanical expectation val- 
ues. We will return shortly to the question of how to use observations from 
one position in a universe produced by one specific sequence of accidents to 
learn about these averages, but first we must introduce some notation. 

We assume that the universe is rotationally invariant on the average, so all 
averages (AT (7) AT (M2) AT (m3) ---) are rotationally invariant functions 
of the directions 7, 72, etc. In particular, (A7(n)) is independent of À, 
Since AT (7) is defined as the departure of the temperature from its angular 
average, its angular average f AT (ñ)d?ñ/4m vanishes. Averaging over the 
position of the observer, we have f (AT dà = 0, so since (AT (fi) is 
independent of fi, it too vanishes. 

The simplest non-trivial quantity characterizing the anisotropies in the 
microwave background is the average of a product of two ATs. Rotational 
invariance requires that the product of two as takes the form 


(demae'm') = Seem -m Ce , (2.6.3) 
for in this case the average of the product of two ATs is rotationally 
invariant: 


2£+ 1 
4r 


) Pe- À), 
(2.6.4) 


ATAATA) =Y AYLAY M= C ( 
lm £ 
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where Py is the usual Legendre polynomial. Given the left-hand side, we 
can find C; by inverting the Legendre transformation 


1 
CoS ag J Pada Pe- A (ATHAT®H)) . (2.6.5) 


Instead of Eq. (2.6.3), we could equivalently define the multipole coefficients 
Ce by 
(dem@ pry) = ôe mm Ce , 

which shows that the C, are real and positive. For perturbations AT that 
are Gaussian in the sense described in Appendix E, a knowledge of the Ce 
tells us all we need to know about averages of all products of ATs. 

Of course, we cannot average over positions from which to view the 
microwave background. What is actually observed is a quantity averaged 
over m but not position: 


Ce = TES Diem a mafea PA Pula ATAATA) . 
(2.6.6) 


The fractional difference between the cosmologically interesting Cz and the 
observed govs is known as the cosmic variance. Fortunately, for Gaussian 
perturbations, the mean square cosmic variance decreases with £. The mean 
square fractional difference is 


2 
C= CoP 1 
Gee =) a ene te Rl 
£ mm 
(2.6.7) 


If AT is governed by a Gaussian distribution, then so are its multipole 
coefficients ag, (but not quantities quadratic in the agm, such as Cy.) It 
follows then that? 


(ae m4¢—m Ae nl Ae Sip) = (dem Ae —m) (ae m ag m) 
+ (aemaem) (ae —m ae m) 


+(aemae—m') (ae-maemw) (2.6.8) 


2Non-Gaussian terms in the probability distribution of anisotropies would show up as non-vanishing 
averages of products of odd numbers of the ag m, as well as corrections to formulas like Eq. (2.6.8) for 
the averages of products of even numbers of the agm. Such non-Gaussian terms are produced both in 
the early universe and at relatively late times. For a review, see N. Bartolo, E. Komatsu, S. Matarrese, 
and A. Riotto, Phys. Rep. 402, 103 (2004). Non-Gaussian terms produced by quantum fluctuations 
during inflation were calculated in the tree graph approximation by J. Maldacena, J. High Energy Phys. 
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Using Eq. (2.6.3), we find that the first term on the right-hand side of 
Eq. (2.6.8) contributes (2€ + 1c? to the sum in Eq. (2.6.7), while the 
second and third terms each contribute (22 + 1)C i to the sum, so that 


(2.6.9) 


This sets a limit on the accuracy with which we can measure Cọ for small 
values of £. On the other hand, the same analysis shows that for £ £ V, 


Ce — CS) (Ce — cgs 
Ce Co 


=0, (2.6.10) 


so the fluctuations of coe away from the smoothly varying quantity Ce 
are uncorrelated for different values of £. This means that when Cor is 
measured for all £ in some range A£ in which Cẹ actually varies little, the 
uncertainty due to cosmic variance in the value of Ce obtained in this range 
is reduced by a factor 1//A€. Even so, measurements of Cy for £ < 5 
probably tell us little about cosmology. Also, measurements for £ > 2, 000 
are corrupted by foreground effects, such as the Sunyaev—Zel’dovich effect 
discussed in the previous section. Fortunately there is lots of structure in 
C at values of £ between 5 and 2,000 that provides invaluable cosmological 
information. 

The primary anisotropies in the cosmic microwave background arise 
from several sources: 


1. Intrinsic temperature fluctuations in the electron—nucleon—photon 
plasma at the time of last scattering,’ at a redshift of about 
1,090. 


05 (2003) 013. The effect of loop graphs is considered by S. Weinberg, Phys. Rev. D 72, 043514 
(2005) [hep-ph/0506236]; Phys. Rev. D. 74, 023508 (2006) [hep-ph/0605244]; K. Chaicherdsakul, Phys. 
Rev. D 75, 063522 (2007) [hep-th/0611352]. For late-time contributions, see M. Liguori, F. K. Hansen, 
E. Komatsu, S. Matarrese, and A. Riotto, Phys. Rev. D 73, 043505 (2006) [astro-ph/0509098]. The 
weakness of microwave background anisotropies indicates that any non-Gaussian terms are likely to be 
quite small. So far, there is no observational evidence of such terms. 

3Strictly speaking, in the approximation of a sudden drop in opacity at a fixed temperature Tr ~ 
3,000 K, it is not the intrinsic fluctuation in temperature we observe, but the consequent fluctuation in 
the redshift zz of last scattering. Since the unperturbed temperature T(t) goes as 1 /a(t), the value az, of 
a(t) at which the total temperature 7(¢) +57 (t) reaches a fixed value Tz is shifted by an amount 6az, such 
that — (Tz /a)ôar +êT (tL) = 0. The observed temperature (leaving aside other effects) is Tr a(tz)/ag, 
so to first order this is shifted by a fractional amount Tz ôar /aoTo = ôT (t)ar /aoTo = 5T(tz)/T_L, 
just as if it were the intrinsic temperature fluctuation that we observe. 


138 


2.6 Primary fluctuations in the microwave background: A first look 


2. The Doppler effect due to velocity fluctuations in the plasma at last 
scattering. 


3. The gravitational redshift or blueshift due to fluctuations in the 
gravitational potential at last scattering. This is known as the Sachs- 
Wolfe effect. 


4. Gravitational redshifts or blueshifts due to time-dependent fluctuations 
in the gravitational potential between the time of last scattering and 
the present. (It is necessary that the fluctuations be time-dependent; 
a photon falling into a time-independent potential well will lose the 
energy it gains when it climbs out of it.) This is known, somewhat 
confusingly, as the integrated Sachs—Wolfe effect.4 


A proper treatment of these effects requires the use of general relativity. This 
will be the subject of Chapters 5—7. On the other hand, from the time the 
temperature dropped below 104 K until vacuum energy became important at 
aredshift of order unity, the gravitational field of the universe was dominated 
to a fair approximation by cold dark matter, which can be treated by the 
methods of Newtonian physics. Therefore in this introductory section we 
will concentrate on the Sachs—Wolfe and integrated Sachs—Wolfe effect, 
which turn out to dominate the multipole coefficients Ce for relatively small 
£, less than about 40. We will make only a few tentative remarks in this 
section about the contribution of intrinsic temperature fluctuations and of 
the Doppler effect. 

In considering the Sachs—Wolfe and integrated Sachs—Wolfe effects, we 
return to the Newtonian approach to cosmology outlined in Eqs. (1.5.21)— 
(1.5.27). The treatment of perturbations to a homogeneous isotropic 
cosmology in this approach is presented in Appendix F. For the moment, 
we need only one result of this analysis, that the perturbation to the 
gravitational potential, when expressed as a function of the co-moving 
coordinate x, is a time-independent function 5¢(x). This perturbation has 
two effects. First, there is the usual gravitational redshift: A photon emitted 
at a point x at the time of last scattering will have its frequency and hence its 
energy shifted by a fractional amount 5¢(x), so the temperature seen when 
we look in a direction 7 will be shifted from the average over the whole sky 
by an amount 


Gre 


F ) = ô Âr) . (2.6.11) 


4R. K. Sachs and A. M. Wolfe, Astrophys. J. 147, 73 (1967). 
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Here rz is the radial coordinate of the surface of last scattering, given by 
Eq. (1.5.44) as 


; 1/2 dx 
L= Pale: ha sinh Qg 7 5 s 
Qg Hoa(to) 1/+z2) Vax + QKx* + MX + QR 
(2.6.12) 


where Qx = 1—94 —Qy — ÊR; zL X 1090 is the redshift of last scattering; 
and fg is the present. The perturbation to the gravitational potential also has 
the effect of changing the rate at which the universe expands by a fractional 
amount 6@(x), and since the temperature in a matter-dominated universe 
is falling like a~! œ t~7/>, this shifts the value of the redshift at which the 
universe reaches the temperature ~ 3,000 K of last scattering in direction 
n by a fractional amount 


( êz ) = (=) » È) 
1+2/) T=3000 K a(t) J T=3000 K a} T=3000 K 


2 
ôb (rLÂ)tL = 3G OLA) 


With all other effects neglected, the temperature observed in direction 7 
would be 3,000 K divided by 1 + z, so this fractional shift in 1 + z changes 
the observed temperature by a fractional amount 


ATÀ) 2 5 
= —-ô : 2.6.13 
( Th ), 3 p(nr) (2.6.13) 
The sum of the fractional shifts (2.6.11) and (2.6.13) gives the Sachs—Wolfe 
effect: Nae i 
( 2) = oD (2.6.14) 


The factor 1/3 will be obtained in Chapter 7 as a result of a better-grounded 
relativistic treatment. 
It is convenient to write ô¢ (x) as a Fourier transform 


80(x) = J d>q eddy . (2.6.15) 
We make use of the well-known Legendre expansion of the exponential 


eax = XO (U Di Pe À) je(qr) , (2.6.16) 
£ 


where jẹ is the spherical Bessel function, defined in terms of the usual 
Bessel function J,(z) by je(z) = (7/22)? Je+1/2(2). Eq. (2.6.14) then 
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gives 


( ATÀ) 
To 


1 CO 
) zo A E Di fay paje (Gre) Pe- (2.6.17) 
SW t=0 


Now we must consider how to calculate the average of a product of two 
of these fractional temperature shifts in two different directions. Although 
5@(x) depends on position, the probability distribution of 5¢(x) as seen 
by observers in different parts of the universe is invariant under spatial 
rotations and translations. This implies among other things that 


(5bq ôy) = Pag) EA +q’). (2.6.18) 


where Pg (q) is a function only of the magnitude of q. (The delta function is 
needed so that (5¢(x)d@(y)) should be only a function of x — y.) Because 
6¢(x) is real, its Fourier transform satisfies the reality condition ôq = d¢-q, 
which together with Eq. (2.6.18) tells us that Pg(q) is real and positive. 

Now, using Eqs. (2.6.17) and (2.6.18), together with the reflection prop- 
erty Pe(—z) = (—1)'Pe(z) and the orthogonality property of Legendre 
polynomials 


ie. i oe 4r RE 
J aoar (à -) Py (à -ĝâ) = (=) See Pe (â -â') 5 (2.6.19) 


we have 


(LATA ATAN sw 


1) Pe- of q dq PADJA), 
(2.6.20) 


£ 
or, comparing with Eq. (2.6.4), 


16x’ T 


[0,0] 

Cosy [ Par Poa@itarn) e62 

To the extent that the gravitational potential is produced by pressureless 
cold dark matter, the differential equation for 6¢ does not involve gradients, 
and so the differential equation for its Fourier transform does not involve 
the wave vector q. (See Eqs. (F.12) and (F.18).) The dependence of dd 
on q then can arise only from the initial conditions for these differential 
equations.’ It is therefore natural to try the hypothesis that the function 


>This is not true even for the Sachs—Wolfe effect if q is so large that g/a became greater than H 
before the density of the universe became dominated by cold matter. As we will see in Chapter 7, this 
qualification affects Cy gw only for £ > 100. 
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P¢(q) has a simple form, such as a power of q. This power is conventionally 
written as n — 4: 


Pla) = Nga" (2.6.22) 


where N s is a positive constant. Then we can use a standard formula: 


243 —n)P (¢ j 27t) 


[0,0] 
:2 n—2 
jis) 8” “ds = ——_______———— , (2.6.23) 
I re (42) r (e+2- 3) 
and find that for £ < 100, Eq. (2.6.21) gives 
16x2 4r (3 — nry "NTR T (¢ + 27+) 
Ce sw > (2.6.24) 


or? (4) r(e+2- 23) 


In particular, even before the discovery of primary fluctuations in the cos- 
mic microwave background there was a general expectation about the values 
ofn and N, based not on the microwave background, but on the large scale 
structure of matter observed relatively close to the present. The perturba- 
tion ôp in the total mass density is related to the Sachs—Wolfe effect through 
the Poisson equation, which gives 


a ?V75o = 4r Gôp , (2.6.25) 


with the factor a~*(t) inserted because it is X = a(t)x that measures proper 
distances. (See Eq. (F.12).) Expressing the Fourier transform of do in terms 
of the Fourier transform of 6¢, we find the correlation function of the density 
fluctuations to be 


(50 (x, NEP, r) = (4r Gada)? J dq qf Polg) ire) | 
(2.6.26) 


The use of this formula to measure Py is discussed in Chapter 8. For 
the present, it is enough to note that observations of the density correla- 
tion function long ago led to the expectation that Pg takes the so-called 
Harrison—Zel’dovich form® with n = 1, and that No © 1075. As we will see 
in Chapter 10, inflationary theories generally predict that n is close but not 


6E. R. Harrison, Phys. Rev. D1, 2726 (1970); Ya. B. Zel’dovich, Mon. Not. Roy. Astron. Soc. 160, 
1P (1972). 
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precisely equal to unity. For n = 1, we obtain a result that is scale-invariant, 
in the sense of being independent of rz: 


2Ar2 m2 
870 NST 


Ea ha (2.6.27) 


Ce, sw > 
This is why experimental data on the cosmic microwave background 
anisotropies is usually presented as a plot of €(€ + 1)Ce versus £. 

What about the other contributions to Ce? Pressure gradients are 
important in the dynamics of the photon—nucleon-electron plasma, so in 
estimating the contributions of the Doppler effect and intrinsic temperature 
fluctuations we must deal with differential equations in which the wave num- 
ber q enters in an important way. Whatever sort of perturbation we are 
considering, we can always write it as a Fourier integral like Eq. (2.6.15) 
and use Eq. (2.6.16) to express e’4"" as a series of Legendre polynomials 
P¢(q - à) with coefficients proportional to je(qrL). The integral over q is 
then dominated by values of q of order £/rz in the case £ > 1. (This is the 
most interesting case because Eq. (2.6.9) shows that it is only for £ >> 1 that 
cosmic variance can be neglected in measurements of Ce.) This is because 
for £ >> 1, the spherical Bessel function je (z) is peaked at z ~ £. Specifically, 
for v = €+1/2 — coandz — oo with v/z fixed at a value other than unity, 
we have 


0 Z<v 
me) > z—1/2(z2 — p2)—!/4 cos (v2 — v? — varccos(v/z) — z) z>v. 
(2.6.28) 


Hence Cy for large £ chiefly reflects the behavior of the Fourier components 
of perturbations for q ~ £/rz. To put this another way, the physical wave 
number at time tz is kz = q/a(tz) (because it is a(tz)x that measures proper 
distances at this time) so Cy for large £ reflects the behavior of the perturba- 
tions fork; ~ €/d,4, where d4 is the angular diameter distance of the surface 
of last scattering 


1 
d4 = rra(tL) = 2 
Qg Hol + Zz) 
l d 
x sinh ae a , 
rep Vaxt + QKX? + Qux + NQR 
(2.6.29) 


with 1 +z, = a(to)/a(tz). 
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For physical wave numbers q/a that are much less than the expansion 
rate H the differential equation governing any one perturbation is essentially 
independent of the wave number q, so that the whole dependence of the 
perturbation on g comes from the initial conditions. (Ratios of different 
perturbations, such as 5¢, and ôpq, may of course depend on q.) Such 
perturbations are said to be “outside the horizon” because the wavelength 
2xa/qis much larger than the horizon distance (strictly speaking, the “par- 
ticle horizon” distance), which we saw in Section 1.13 is roughly of order 
1/H. During the radiation- or matter-dominated eras g/a decreased like 
t—!/2 or t-2/3, while H decreased faster, like 1/t, so perturbations that were 
outside the horizon at early times subsequently came within the horizon, 
those with high wave number entering the horizon earlier than those with 
lower wave numbers. 

We need to be a little more precise about the horizon distance. At the time 
of last scattering the universe was largely matter dominated, so as shown in 
Section 1.13 the horizon distance at that time was approximately 2/H (tz). 
But this is the maximum proper distance that light could have traveled since 
the beginning of the present phase of the expansion of the universe. As 
we will see in Chapters 5 and 6, the dominant perturbations to the plasma 
of nucleons, electrons, and photons that are relevant to the anisotropies in 
the cosmic microwave background are sound waves, so we need to take into 
account the smaller speed of sound. During the era before recombination, 
when radiation and matter were in thermal equilibrium, the speed of sound 
was Uy = (dp Jp)! 2 where ôp and ŝp are infinitesimal variations in the 
pressure and density in an adiabatic fluctuation. In such a perturbation, the 
entropy per baryon remains unperturbed: 


1 
o= xa (É) +a (5) l 
NB NB 


where e€ is the thermal energy density, ng is the baryon number density, and 
p is the pressure. As we saw in Section 2.2, there are so many more photons 
than baryons that both «€ and p are dominated by radiation: 


1 
c=agT*, p= quel. 
and therefore for adiabatic perturbations 


êT ong 50B 


= = x (2.6.30) 
T ng PB 
where ppg is the baryonic mass density. This gives a sound speed 
4 1/2 
_[  4agT*8T/3 E 1 (26.31) 
” — \ Son + 4apT35T 3G R) Bi 
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where R = 3pg/4agT*. Since pg x a~? and T œ a~!, we have R « a, and 
hence dt = dR/HR, or in more detail, 


rE dR a RdR 
RHoV/ Qu (R/R) + QR(Ro/RY Ho Qu Re. /Req +R 


where Rag = QRRo/QY = 3QRQB/4Q YL, is the value of R at matter- 
radiation equality. The acoustic horizon distance is then 


d tL dt r tL dt 
n=a f ajsi+R) Lf RV30 +R) 


_ 2 In V1+Rr+/ReQ + Rr 
~~ AopV3RLQuy (1 + 21)3/2 1+ ./REQ 


where Rz = 398/49, (1+ zz) is the value of R at last scattering. Gradients 
become important when kz ~ 1/dy, and since in Ce the integral over wave 
number is dominated by kz ~ €/d4, gradients become important when £ 
reaches the value £y, where £y = d4/dy. Both d4 and dy are proportional 
to 1/Ho, so Ho cancels in the ratio. 

For a crude estimate of ly, we note that d4 is proportional to (I+zz)~!, 
while dy is proportional to (1+zz)~>/”, so £y is of the order of (1+z,)!/? = 
33. To get a closer estimate, we will take cosmological parameters to 
have sample values suggested by supernova observations and cosmologi- 
cal nucleosynthesis (discussed in Section 3.2): Qy = 0.26, Qa = 0.74, 
Ng = 0.043. This gives Rz = 0.62, Reg = 0.21, d4 = 3.38 Hy ‘(1 +z)! 
and dy = 1.16 Hy ‘(1 + z1)7°⁄, and hence £y = 2.9/1 FZL = 96. 

We can now estimate the relative magnitude of the contributions to Ce 
other than the Sachs—Wolfe effect: 


, (2.6.32) 


e Doppler effect: Like any vector field, the plasma velocity can be 
decomposed into a term given by the gradient of a scalar, plus a 
“vector” term whose divergence vanishes. Appendix F shows that 
the vector term decays as 1/a, so the dominant perturbations are 
compressional modes, for which the velocity is the gradient of a scalar. 
We can therefore expect that in the integral over wave numbers q for 
AT /To, the contribution of the Doppler effect will be suppressed for 
small wave numbers by a factor of order kzdy. We will see in Chapter 
7 that, because it is proportional to the vector kz, the Doppler effect 
contribution does not interfere with the contribution of the Sachs- 
Wolfe effect, so for a multipole order £ « £y, the contribution of the 
Doppler effect will be less than that of the Sachs—Wolfe effect by a 
factor of order [(€/d4)dy}* = €7/£7,. 
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Intrinsic temperature fluctuations: As we have seen in Eq. (2.6.30) the 
intrinsic fractional temperature fluctuation at the time of last scattering 
will be just one-third the intrinsic fractional perturbation in the plasma 
density. As discussed in Chapters 5 and 6, the particular perturbations 
that are believed to dominate outside the horizon are adiabatic in the 
further sense that the fractional perturbation in the plasma density is 
equal to the fractional perturbation ôp/p in the total matter density. 
But the perturbation to the total matter density is related to the per- 
turbation to the gravitational potential by Poisson’s equation (2.6.25), 
which if evaluated at the time of last scattering gives for the Fourier 
transform: 


2 2 


q L 
= ———— ô 
$a 4r G 


PUR STT 


dy > 


where, as before, k = g/a(tz). Also, the mean total mass density o(tz) 
at last scattering is related to the horizon size dy by 


3H? (t) 1 
BnG = AnGdj, ’ 


PL) = 


so the order of magnitude of the intrinsic fractional temperature pertu- 
rbation is related to the perturbation to the gravitational potential by 


dT (tr) _ dp (tL) 
T(t) 38L) 


Thus we expect that in the integral over wave numbers q for A T / To, the 
contribution of intrinsic temperature fluctuations will be suppressed 
for small wave numbers by a factor k? d3}. The interference of this 
contribution with the Sachs—Wolfe term then makes a contribution to 
Ce for £ & Lhorizon that, like the contribution of the Doppler effect, 
is smaller than the Sachs—Wolfe contribution by a factor of order 
[(¢/da)du¥ © C/Q rizon (It should be noted that the distinction 
between the Sachs—Wolfe effect and the effect of intrinsic temperature 
fluctuations depends on how the time coordinate is defined. The non- 
relativistic estimates made here correspond to what in the relativis- 
tic treatment of Chapters 5-7 would be the use of what are called 
Newtonian gauge coordinates.) 


~ kid} ôgg . (2.6.33) 


Integrated Sachs—Wolfe effect: As already mentioned, to the extent 
that 5¢ is truly time independent, there is no integrated Sachs—Wolfe 
effect. The blueshift caused by a photon falling into a time-independent 
gravitational potential well along the line of sight would be canceled by 
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the redshift caused when the photon climbs out of the well.’ In fact, the 
perturbation to the gravitational field is not strictly time-independent, 
both because radiation continues to make a non-negligible contribu- 
tion to the gravitational field for some time after last scattering, and 
also because at late times vacuum energy requires modifications to 
the Newtonian treatment presented in Appendix F. The late-time inte- 
grated Sachs—Wolfe effect chiefly affects Ce for £ less than about 10. 


It is reasonable then to assume that for 10 < £ < 50, the dominant 
contribution to Cy is from the Sachs—Wolfe effect. 

Apart from observation of the £ = 1 anisotropy due to the earth’s motion 
through the microwave background, the first detection of an anisotropy in 
the cosmic microwave radiation background was achieved by the COBE 
satellite in 1992.8 This experiment scanned the sky with two microwave 
antennae separated by 60°, at frequencies 31.5, 53, and 90 GHz near the 
minimum of emission from our galaxy and the maximum of the Planck 
distribution for 2.7 K. The 1992 data showed an rms fluctuation in the 
temperature with angle of 30+5 uK, with an angular distribution consistent 
with n = 1. After four years, values of Ce had been measured? with the 
same instruments at values of £ ranging from £ = 2 to £ = 40. For £ > 4 
the results were fit to the €-dependence given by Eq. (2.6.24), with the result 
that n = 1:13+03,; which is consistent with the value n = 1 for what is called 
a Harrison—Zel’dovich spectrum. This result is often written as 


- Arg? 
~ 5e(€ +1)’ 


with Q known as the quadrupole moment. Fitting the values of Ce for 
10 < £ < 40, the 1996 COBE results gave Q = 18+ 1.4K. Comparing 
Eq. (2.6.34) with (2.6.27), we see that 


27 Q —6 
No = | — = = (8.7 + 0.7) x 10™. 2.6.35 
$t=V an, ) x ( ) 


Surprisingly, the multipole coefficients for £ = 2 and £ = 3 were found to 
be much less than would be expected by extrapolation of Eq. (2.6.34) from 


Ci (2.6.34) 


7There is also an anisotropy produced by time-dependent fluctuations in the gravitational potential of 
cosmological inhomogeneities (such as concentrations of cold dark matter), known as the Rees—Sciama 
effect; M. J. Rees and D. W. Sciama, Nature 217, 511 (1968). This is expected to be quite small; see U. 
Seljak, Astrophys. J. 460, 549 (1996). For possible larger effects due to local structures, see A. Rakić, 
Syksy Räsänen, and D. J. Schwarz, Mon. Not. Roy. Astron. Soc. 363, L27 (2006) [astro-ph/0601445]. 

8G. F. Smoot et al., Astrophys. J. 396, L1 (1992) 

°C. L. Bennett et al., Astrophys. J. 464, L1 (1996) [astro-ph/9601067]; A. Kogut et al., ibid, L5 (1996) 
[astro-ph/9601066]; K. M. Górski et al., ibid, L11 (1996) [astro-ph/9601063]; G. Hinshaw et al., ibid, 
L17 (1996) [astro-ph/9601088]: E. L. Wright et al., ibid, L21 (1996) [astro-ph/9601059]. 
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the fit for £ > 4,!° as shown by the apparent absence of two-point correla- 
tions for angles greater than about 60°. The discrepancy with theory is even 
worse when the integrated Sachs—Wolfe effect is taken into account. This 
result has since been confirmed by observations with the WMAP satellite, !! 
to be discussed in Chapter 7. It is quite possible that this discrepancy is 
due to a combination of foreground contamination and cosmic variance, !? 
which according to Eq. (2.6.9) is 63% for £ = 2. 

We will have to wait to discuss calculations of Ce for large £, above the 
range of validity of Eq. (2.6.34), until Chapter 7, after we have developed 
the general relativistic theory of cosmological fluctuations in Chapters 5 
and 6. In Chapter 7 we will also come back to the observations over the 
past decade that have refined the COBE measurements and extended them 
to higher £. 


10G. Hinshaw et al., Astrophys. J. 464, L25 (1996) [astro-ph/9601061]. 

1D, N. Spergel et al., Astrophys. J. Suppl. 148, 175 (2003). 

12G. Efstathiou, Mon. Not. Roy. Astron. Soc. 346, L26 (2003) [astro-ph/0306431]; A. Slosar, U. 
Seljak, and A. Makarov, Phys. Rev. D 69, 123003 (2004) [astro-ph/0403073]; A. Slosar and U. Seljak, 
Phys. Rev. D 70, 083002 (2004); G. Hinshaw et al., Astrophys. J. Suppl. Ser. 170, 288 (2007). 
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3 
The Early Universe 


We have been exploring the period in the history of the universe when the 
radiation temperature dropped from a little over 104 K down to its present 
value of 2.725 K. We now want to look back to the era when the temperature 
was greater than about 104 K, well before the energy density in radiation fell 
below that of baryons and cold dark matter. We will carry the story back 
to when the temperature was above 10!° K, when electron—positron pairs 
were abundant and neutrinos were in equilibrium with these pairs, and even 
farther back, as far as our current knowledge of the laws of physics will 
take us. 


3.1 Thermal history 


We first want to work out the history of the falling temperature of the 
early universe. In this section we will look back only to a time when the 
temperature was between 104 K and 10!! K, which is low enough so that 
muon—antimuon and hadron—antihadron pairs were no longer being pro- 
duced in appreciable numbers. 

There are two circumstances that greatly simplify this task. The first is 
that the collision rate of photons with electrons and other charged particles 
during this era was so much greater than the expansion rate of the universe 
that the photons and charged particles can be assumed to have been in ther- 
mal equilibrium, with a common falling temperature. At sufficiently early 
times even the neutrinos and perhaps the cold dark matter particles were 
also in thermal equilibrium with the photons and charged particles; later, 
when no longer colliding rapidly with other particles, they can be treated 
separately as free particles. The other circumstance is that the number den- 
sity of baryons (or more strictly, the number density of baryons minus the 
number density of antibaryons) is so much less than the number density of 
photons that we can ignore the chemical potential associated with baryon 
number. Baryons will be put back into the picture in the following section. 
Also, because the electron/photon number ratio is so small now, it is rea- 
sonable to assume that the universe has always had a very small net lepton 
number density (the number density of leptons of all sorts minus that of 
antileptons) per photon. This means that even at temperatures of order 
10!° K, when electron—positron pairs were abundant and the energy density 
and pressure were not simply proportional to T4 and the entropy density 
was not simply proportional to T°, the entropy density, energy density, and 
pressure were functions s(T), o(T), and p(T) of the temperature alone. 


149 


3 The Early Universe 


(The possibility of a non-negligible net lepton number is discussed at the 
end of this section.) 

Before studying the history of the universe during this era we will have to 
take a brief look at the thermodynamics and statistical mechanics of this sort 
of matter, in thermal equilibrium with negligible chemical potentials. The 
condition of thermal equilibrium tells us that the entropy in a co-moving 
volume is fixed 

s(T)a* = constant . (3.1.1) 


The second law of thermodynamics says that any adiabatic change in a 
system of volume V produces a change in the entropy given by 


d(p(T) V) + pT) dV 


d(s(T)V) = T 


(3.1.2) 
Equating the coefficients of dV gives our formula for the entropy 
density 

p(T) + p(T) 
re. 
Also, equating the coefficients of V dT and using Eq. (3.1.3) give the law 
of conservation of energy: 


S(T) = (3.1.3) 


dp(T) 

T aT 7 p(T) +p(T). (3.1.4) 
(For instance, for radiation we have p(T) = o(T)/3, so Eq. (3.1.4) gives 
the Stefan—Boltzmann law p = agT* with ag a constant that cannot be 
determined from thermodynamics alone; Eq. (3.1.3) then gives the entropy 
density for radiation as s(T) = 4ag T?/3. This is why in we said in Section 
2.2 that the constant okg = 4agT?/3ng may be interpreted as the radiation 
entropy per baryon.) 

With equal numbers of particles and antiparticles, the number den- 
sity n(p)dp of a species of fermions (such as electrons) or bosons (like 
photons) of mass m and momentum between p and p + dp is given by 
the Fermi—Dirac or Bose-Einstein distributions (with zero chemical 
potential) 


An gp” 1 
,T)= , 3.1.5 
MUR (2h)? (—s ra, a 


where g is the number of spin states of the particle and antiparticle, and the 
sign is + for fermions and — for bosons. For instance, for photons g = 2 
(and of course m = 0), because photons have two polarization states and 
they are their own antiparticles, while for electrons and positrons g = 4, 
because they have two spin states and electrons and positrons are distinct 
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particle species. The energy density and pressure of a particle of mass m are 
given by the integrals! 


p(t) = f n(p, T) dp ,/p2 + m? , (3.1.6) 
0 


p 


3\/p2 Em 


The entropy density of this particle is then given by Eq. (3.1.3) as 


p(T) -f n(p, T) dp (3.1.7) 


1 i —— Pp 


In particular, for massless particles Eq. (3.1.6) gives 


T)= i An p dp ( 1 ) 
OUSE |. Ori eoor] 


=| gagT*/2 bosons 


TgapT*/16 fermions ’ Ci) 
and of course p(T) = o(T)/3 and s(T) = 40(T)/3T. In other words, each 
species and spin of massless fermions makes a contribution to the energy 
density, pressure, and entropy density that is just the same as for each polar- 
ization state of photons, except for an additional factor 7/8. 

During a period of thermal equilibrium, the variation with time of the 
temperature is governed by the equation (3.1.1) of entropy conservation and 
the Einstein field equation, with curvature neglected, 


a 82Gp(T) 
= . 3.1.10 
Combining these gives 
s'(T) dT 
=- + constant . 3.1.11 
J s(T)V 24r Go(T) ( ) 


(The minus sign is inserted in taking the square root of T?, to take account 
of the fact that the temperature decreases as time passes.) In particu- 
lar, during any epoch in which the dominant constituent of the universe 
is a highly relativistic ideal gas, the entropy and energy densities are 


lEq. (3.1.6) follows directly from the definition of n(p, T), and Eq. (3.1.7) can then be derived from 
Fq. (3.1.4). Or both Eqs. (3.1.6) and (3.1.7) can be obtained from Eqs. (B.41) and (B.43). 
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given by 
2 T? 
s(T)= Nast (3.1.12) 
T4 
p(T) = l (3.1.13) 


where WN is the number of particle types, counting particles and antiparticles 
and each spin state separately, and with an extra factor of 7/8 for fermions. 
Then Eq. (3.1.11) becomes 


| 3 1 
= |- = : 1.14 
t i6nGNag T? + constant (3 ) 


With this background, let us now start our history at a time when the 
temperature was around 10!! K, which is in the range m, > kgT > me. 
Even though it was too cold at this time for reactions like v, + e > u + ve 
or vr +e > T + ve, the u and t neutrinos and antineutrinos were kept 
in thermal equilibrium by neutral current reactions, like neutrino-electron 
scattering or et +e~ = v+d. Hence the constituents of the universe at this 
time were photons with two spin states, plus three species of neutrinos and 
three of antineutrinos, each with one spin state, plus electrons and positrons, 
each with two spin states, all in equilibrium and all highly relativistic, giving 


7 43 
Nata O a =i (3.1.15) 
so that Eq. (3.1.14) gives, in cgs units: 
t ae l + constant = 0.994 A i + tant 
-N ER = 0. sec | ——— constant . 
1722 Gag T? 1010K 
(3.1.16) 


For instance, with muons ignored and the mass of the electron neglected, it 
took 0.0098 sec for the temperature to drop from a value 10! K to 10!! K, 
and another 0.98 sec for the temperature to drop to 10!° K. 

At a temperature of about 10!° K neutrinos were just going out of 
equilibrium and beginning a free expansion. The weak interaction cross 
section for neutrino-electron scattering is roughly owk ~ (AGwk kge T)?, 
where Gy ~ 1.16 x 10~>GeV~? is the weak interaction coupling constant, 
and the factor A” is included to convert a quantity with the units (energy)~? 
to a quantity with the units (length)? of a cross section. (Recall that we are 
using units with c = 1.) The number density of electrons at temperatures 
above 10!° K is roughly given by ne © (kgT/f)%, so the collision rate of a 
neutrino with electrons or positrons at such temperatures is 


Ty = now, © G2 (keT) /h. 
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This may be compared with the expansion rate, which during the radiation- 
dominated era is of the order 


H x y GlkgT)t/b , 


with the factor A~’ included to convert a quantity with the units (energy) 
to a quantity with the units mass/length* of a mass density. The ratio of 
the collision rate to the expansion rate is thus 


Io AD 1/2 3 Pa 

A ~ Gy A/G) (kgTY > (ax) 
Hence neutrinos were scattered rapidly enough to remain in thermal 
equilibrium at temperatures above 10! K. This is just a little greater than 
me/kp, so for lower temperatures electrons and positrons rapidly disapp- 
eared from equilibrium, the collision rate dropped rapidly below Ga 
(kgT)>/h, and hence the ratio ,/H dropped rapidly below unity. The 
neutrinos then began a free expansion, in which (as we saw in Section 2.1) 
the number density distribution n, continued to keep the form (3.1.5), with 
a temperature T, « 1/a. 

At lower temperatures we must take into consideration the finite mass of 
the electron, so the temperature T of the electrons, positrons, and photons 
(which were still in equilibrium with each other) no longer fell as 1/a. On 
the other hand, the freely expanding massless neutrinos preserved a Fermi— 
Dirac momentum distribution,” with a temperature that continued to drop 
as 1/a. We must therefore now distinguish between the photon temperature 
T, and the neutrino temperature Ty. 

The entropy density of the photons, electrons, and positrons is 


4agT? 4 [(®4rp d, 2 
dp eae Í P e{ Primin 
0 


3 Th Ori 3p? +m 
1 
x 
exp (vr + mž/ksT) +1 
4agT? 
z < S(me/kpT) , (3.1.17) 


2 This is not exact; even at temperatures under 10!° K, the neutrino distribution was slightly affected 
by weak interaction processes, such as e7 + et —> v +0. See A. D. Dolgov, S. H. Hansen, and 
D. V. Semikoz, Nucl. Phys. B 503, 426 (1997); 543, 269 (1999); G. Mangano, G. Miele, S. Pastor, and 
M. Peioso, Phys. Lett. B534, 8 (2002). For a review, see A. D. Dolgov, Phys. Rep. 370, 333 (2002). The 
weak interactions provide some thermal contact between the neutrinos and the plasma, which is being 
heated by electron—positron annihilation, so the effect is to slightly increase the neutrino energy density, 
by an amount usually represented as an increase in the effective number of neutrino species, from 3 to 
3.04. This effect is neglected in what follows. 
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where, recalling that ag = 1°k4/ 15%, 


45 
S(x)=l+ 


oo 2 
2 2 2 y 1 
— y“ dy | y y4 + x4 + : 
274 Jo ( oa) exp /y?2+x2+41 


(3.1.18) 


The entropy conservation law (3.1.1) gives aT?S(me/kgT) constant, and 
since T, œ 1/a, this means that T, is proportional to TS!/3(m./kgT). The 
temperatures were equal for kgT >> me, and 


5) =1422= 11/4, (3.1.19) 


So 
1/3 
T,= (4/11) YT S'Bone/kpT) - (3.1.20) 


The ratio T/T, rose from very close to unity for T > 10!! K to 1.008 at 
T = 10!° K and to 1.346 at T = 10° K. To find the asymptotic value 
of T/T, without a computer calculation, we note that S(co) = 1, so for 
kgT «me, Eq. (3.1.20) gives 


1/3 
TITS (11/4) = 1.401. (3.1.21) 


In particular, at the present time, when T = 2.725K, the neutrino 
temperature is 1.945 K. Unfortunately there does not seem to be any way 
of detecting such a neutrino background. 

With three flavors of neutrinos and antineutrinos, the total energy 
density during this period is 


4 


T œ 4 2 d Po) 2 
p(T) = 5 EE aT +a | os aa 
8 o Oth)” exp (Jp? +mz/keT)+1 
= agT*E(me/kpT) , (3.1.22) 
where 
fA o Jy? + xd 
ae ees (=) StB) + = | NES. B10) 
8 H m Jo expyy?+x?+1 
We insert Eqs. (3.1.17) and (3.1.22) in Eq. (3.1.11), and find 
= J eo o 7 3) dT 
S(me/kgT) TV 240 GagT4E(me/kpT) 
= tf (3 ape =) E-l) x dx , (3.1.24) 
S(x) 
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where x = me/kgT , and in cgs units 
-1/2 
AE (24x G cag ms/Ks) = 4.3694 sec . (3.1.25) 


The values of 7/7, and of the time required for the temperature to fall 
to T (calculated from Eq. (3.1.11)) are given for various values of T in 
Table 3.1. 

After the era of electron—positron annihilation, the energy density of 
the universe was dominated for a long while by photons, neutrinos, and 
antineutrinos, all of them highly relativistic, so during this period we have 
s(T) x T3, and 


7 
p(T) = apT*+_-3-agT4 =agT* (1+ —-3- (4/114 ) = 3.363 agT*/2. 
8 8 


(3.1.26) 


Table 3.1: Ratio of electron-photon temperature T to neutrino temperature T, and the time 
t required for the temperature to drop from 10!! K to T, for various values of T. 


T (K) T/T, t(sec) 
10!! 1.000 0 
6 x 1010 1.000 0.0177 
3 x 10!0 1.001 0.101 
2x 10/9 1.002 0.239 
10!0 1.008 0.998 
6 x 10° 1.022 2.86 
3 x 10° 1.080 12.66 
2 x 10° 1.159 33.1 
10° 1.345 168 
3 x 108 1.401 1980 
108 1.401 1.78 x 104 
107 1.401 1.78 x 10° 
10° 1.401 1.78 x 108 
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That is, during this period the effective number of particle species is M = 
3.363. Using Eq. (3.1.14) then gives, in cgs units: 


—2 


3c? 1 T 
f= — T6. Taug T2 +constant = 1.78 sec kA +constant . 
(3.1.27) 


For instance, the time required for the universe to cool from a temperature 
of 10° K (where electrons and positrons have mostly annihilated) to a tem- 
perature of 108 K is 1.76 x 104 sec, or 4.9 hours. 

According to Eq. (3.1.27), the time required for the temperature to drop 
to 10° K from much higher values is 1.78 x 108 sec, or 5.64 years. At 
lower temperatures we must take into account the energy density of non- 
relativistic matter, and Eq. (3.1.27) no longer applies. We saw in Section 2.3 
that for Rmh? = 0.15, it took an additional 360,000 years for the universe 
to cool to the temperature 3, 000 K of last scattering. 


So far in this section we have been assuming that neutrinos are massless, 
and that the net neutrino number of each of the three types (that is, the 
number of neutrinos minus the number of antineutrinos) is much less than 
the number of photons. In the general case of an ideal gas of particles of 
mass m, the number n(p) dp of particles of momentum between p and p+ dp 
is given by the Fermi—Dirac and Bose-Einstein distributions 


An gp* dp 1 
rh)? \ exp[(/p2 + m2 — u)/kgT]+1 


where u is the chemical potential for the particle in question, a quantity 
that is conserved in any reaction occurring rapidly in thermal equilibrium, 
and again g is the number of spin states of the particle and antiparticle, and 
the sign is + for fermions and — for bosons. This reduces to Eq. (3.1.5) 
in the case of zero chemical potential, and it yields the number density 
(2.3.1) for non-relativistic particles with p « mand kgT « m. During the 
whole of the era of interest here, electrons and positrons rapidly annihilated 
into photons, so their chemical potentials were equal and opposite, and 
since we are assuming charge neutrality and neglecting the tiny number 
of baryons per photon, we can conclude that the chemical potentials of the 
electrons and photons were much less than kg7. At temperatures at which 
neutrinos and antineutrinos were in thermal equilibrium with electrons, 
positrons, and photons, reactions like et + e~ = v; + D; were occurring 
rapidly (where i = e, u, t label the three types of neutrino), so the chemical 


n(p, T, 1) dp = i (3.1.28) 
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potential u; of each type of neutrino was equal and opposite to the chemical 
potential of the corresponding antineutrino. But if we do not assume zero 
net neutrino (or lepton) number, then there is no a priori reason why the ui 
had to be less than kgT. 

If neutrino masses are less than about 1 eV, then they may be neglected at 
the temperatures of interest in this section. The observations of oscillations 
between different flavors of neutrinos from the sun, nuclear reactors, and 
cosmic rays shows? that the two differences in the squares of the masses of 
the three types of neutrinos of definite mass (which are mixtures of neutrinos 
of electron, muon, and tau flavor) are 8.0194 x 1075 eV? and between 1.9 


and 3.0 times 107? eV?. Thus the neutrino masses are all much less than 1 
eV, unless they are highly degenerate, which there is no reason to expect. If 
degenerate, then from the absence of anomalies in the low-energy beta decay 
of tritium, their common mass must be less than 2 eV. Whether degenerate 
or not, it is clear from this that all three neutrino types (if there are only 
three) have masses very much less than 1 MeV, and were therefore highly 
relativistic at the time that they went of thermal equilibrium with electrons 
and positrons, at a temperature of about 10!° K. Once out of equilibrium, 
their momentum simply decayed as | /a (as shown in Section 1.1), so if their 
chemical potential was negligible their momentum distribution remained 
the same as that of photons, with a temperature less by a factor (4/11)!/3. 
Thus once kg T dropped below the smallest neutrino mass, their energy den- 
sity became just n, `, m, = (3/11)ny, >¢,, m. (For kgT much larger than 
the mass, the integral for the number density of each spin state of fermions 
is 3/4 the corresponding integral for bosons, and after neutrinos decouple 
T3 = (4/11)T}.) With a non-zero chemical potential the energy density 
is larger. The agreement between theory (with massless neutrinos) and 
observation for the cosmic microwave background anisotropies discussed 
in Sections 2.6 and 7.2 and for the large scale structure discussed in Chapter 
8 shows that the sum of the three neutrino masses is less than 0.68 eV (95% 
confidence level),* so if they are degenerate the common mass is less than 
0.23 eV. This result has been contradicted by the observation of neutrinoless 
double beta decay in a Heidelberg—Moscow experiment,’ which suggests a 
value greater than 1.2 eV for the sum of neutrino masses. There has not 
yet been an opportunity to confirm the double beta decay results, and for 
the present it seems reasonable to continue to neglect neutrino masses. 


3W.-M. Yao et al. (Particle Data Group), J. Phys. G. 33, 1 (2006). 

4D. N. Spergel et al., Astrophys. J. Suppl. Ser. 170, 377 (2007) [astro-ph/0603449]. 

5H. V. Klapdor-Kleingroth, I. V. Krivosheina, A. Dietz and O. Chkvorets, Phys. Lett. B 586, 198 
(2004). 

6A. De La Macorra, A. Melchiorri, P. Serra, and R. Bean, Astropart. Phys. 27, 406 (2007) [astro- 
ph/0608351]. 
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With neutrino masses neglected, the energy density, pressure, and net 
lepton number density of neutrinos and antineutrinos of type i is 


pi = 3pi 


An 3 ( l 1 
= d 
aan | PP \ ol —ma/keT) +1 explo + ap/keT] +1 


aoe m Pty IkgT), (3.1.29) 


1 1 
á (e mi)/kgT]+1 expl(p +n)/kgT]+ i) 
er 


Om ‘Onn PiikeT), (3.1.30) 


where 
Pix) = f [E+ + e] ay, (3.1.31) 
0 


Moy = f [e= = (e= +1) 1] ydy. (3.1.32) 
0 


As we have seen, at temperatures above 10!° K the energy density of 
the photons and electron—positron pairs is 11 ag T4/4, and the pressure is 
one-third as great, so the total energy density and pressure are given by 


llag 4r ký 
4 rh)? 


p=3p =T" XO Plmi/kgT) | . (3.1.33) 


i=e,4,T 


The equation (1.5.20) of energy conservation tells us that under these 
circumstances pa‘ is constant, while the conservation of each type of neu- 
trino number also tells us that n;a? is constant. Since p and n; depend in 
different ways on the chemical potentials, this requires that as the universe 
expands in this era the u;/kgT remain constant, and also T œ 1/a, just as 
in the case of zero chemical potential. 

As the temperature dropped below 10!° K the temperature of the 
photons and electron—positron pairs no longer varied as 1 /a, but as we have 
seen the neutrinos and antineutrinos entered on a free expansion. With 
each neutrino’s momentum p varying as | /a, the form of the Fermi—Dirac 
distributions for each type of massless neutrino was preserved, with a tem- 
perature T, œ l/a and u;/kgT constant, just as before decoupling. We 
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conclude that to a good approximation T, œx l/a and each u; « l/a 
throughout the whole era of interest here. 

The only effect that a non-zero neutrino chemical potential would have 
on the calculations of this section is that it would increase the total energy 
density and hence shorten the time scale. For any non-zero u the function 
P(u/kgT)is greater than P(0) = 724/60, soif u; £ Othen p; > 7x? (kgT)4/ 
120(4)? = (7/8)agT*. In particular, if |u;| > kgT then P(ui/kgT) ~ 
(wi/kpT)*/4, so pi = m(ui)*/(2h)? >> agT*, and these neutrinos (or 
antineutrinos, if exp(—u;/kgT) is large) dominated the energy density of the 
universe, at least until the cross-over of non-relativistic matter and radiation. 
Inspection of Eq. (3.1.28) shows that for a chemical potential u >> kgT, 
the fermion distribution function n(p, T, u)dp is equal to 47 gpdp /(2h)? 
for particle energies less than u, and then falls off rapidly for higher ener- 
gies, indicating that all fermion energy levels up to energy u are filled, while 
higher energy levels are empty. This is the case of complete neutrino degen- 
eracy.’ Experiments on the beta decay of tritium, 7H —> *He + e~ + De 
show that |jz,| is less than a few eV, because otherwise for ue > O there 
would be a rise in the electron spectrum beyond the expected endpoint 
m(?H) — m@He), due to absorption of degenerate cosmic neutrinos in the 
reaction ve +7H —> 3He+e7, while for Ue < Othe Pauli exclusion principle 
would produce a dip in the electron spectrum within a few eV of its expected 
endpoint, where antineutrino energies are less than a few eV, putting them 
within the degenerate antineutrino sea. (The absence of a dip within a few 
eV of the expected electron spectrum endpoint also sets a limit of a few eV 
on the electron neutrino mass.) This yields an upper bound on the time- 
independent quantity |ue|/kgT of order 10*, which is very much weaker 
than the upper bound that will be provided in the following section by con- 
siderations of cosmological nucleosynthesis. 


3.2 Cosmological nucleosynthesis 


We have worked out the thermal history of the universe from temperatures 
above 10!° K down to the crossover temperature ~ 104 K, ignoring the 
presence of a small number of nucleons (and a small excess of electrons 
over positrons) Now let us consider what happens to the nucleons during 
this era. 


TThe possibility of cosmic neutrino degeneracy was raised by S. Weinberg, Phys. Rev. 128, 1457 
(1962). 

'T outlined the history of these calculations in The First Three Minutes (Basic Books, 1977, 1988). 
Briefly, the first calculations of cosmological nucleosynthesis were undertaken by Ralph Alpher, George 
Gamow, and Robert Herman in the late 1940s; see G&C, Chapter 15, footnotes 51 and 52. In this 
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The weak interactions allow neutron—proton conversion through six 
processes: 


n+vepte , nt+et=p4+0D, nẹæp+e +v. (3.2.1) 


(Here v is ve; the other neutrino flavors do not contribute to these reactions.) 
In this range of temperatures kgT « my, so the nucleons can be treated as 
essentially at rest. The initial and final lepton energies are therefore simply 
related, by 


E.-E,=Q forn+vepee , 
E,y-Ee=Q fornt+et=p+T (3.2.2) 
E,+E=Q forn=pt+e +7, 


where 
Q = Mn — mp = 1.293 MeV . (3.2.3) 


The total rates at which an individual neutron is converted to a proton or a 
proton to a neutron take the form 


1/2 
m (Q +q}? adq 


(Q+? (1 + ea/ksT») (1 i e—=(0+0)/keT ) í 
(3.2.4) 


an> p =a f 


work it was assumed that nucleons start as pure neutrons, which then convert to protons by the process 
of neutron beta decay. It was then pointed out that the conversion of neutrons into protons and vice 
versa occurs primarily through two-particle collisions, and that the rapid rate of these processes at very 
early times has the consequence that nucleons start as 50% neutrons and 50% protons, by C. Hayashi, 
Prog. Theor. Phys. (Japan) 5, 224 (1950). Following this, a modern calculation of the evolution of the 
neutron/proton ratio was presented by R. A. Alpher, J. W. Follin, Jr., and R. C. Herman, Phys. Rev. 92, 
1347 (1953), but the results were not applied to the problem of cosmological nucleosynthesis. Several 
authors noted that the abundance of helium in the universe is too large to be accounted for by stellar 
nucleosynthesis; see G. Burbidge, Pub. Astron. Soc. Pacific 70, 83 (1958); F. Hoyle and R. J. Tayler, 
Nature 203, 1108 (1964). The modern theory of cosmological nucleosynthesis is due to P. J. E. Peebles, 
Astron. J. 146, 542 (1966). (Related calculations done by Ya. B. Zel’dovich, Adv. Astron. Astrophys. 
3, 241 (1965) were not known in the West until much later.) Nucleosynthesis calculations were then 
extended to more nuclides and reactions by R. V. Wagoner, W. A. Fowler, and F. Hoyle, Astrophys. J. 
148, 3 (1967), and many small corrections were included by D. A. Dicus, E. W. Kolb, A. M. Gleeson, 
E. C. G. Sudarshan, V. L. Teplitz, and M. S. Turner, Phys. Rev. D 26, 2694 (1982). Modern reviews are 
given by G. Steigman, in Measuring and Modeling the Universe — Carnegie Observatories Astrophysics 
Series, Volume 2, ed. W. Freedman (Cambridge University Press, Cambridge, UK) [astro-ph/0307244]; 
in The Local Group as an Astrophysical Laboratory — Proceedings of the May 2003 STScI Symposium 
[astro-ph/0308511]; in Chemical Abundances and Mixing in Stars in the Milky Way and its Satellites — 
Proceedings of the ESO/Arcetrei Workshop, eds. L. Pasquini and S. Randich (Springer-Verlag) [astro- 
ph/0501591}; and Int. J. Mod. Phys. E15, 1 (2006) [astro-ph/0511534]. For discussions emphasizing 
analytic calculations, see G&C, Section 15.7, and V. Mukhanov, Int. J. Theor. Phys. 43, 669 (2004) 
[astro-ph/0303073]. 
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m \ (Q+4)°a°dq 
npn) =A f 1-7 ee 
(Q+4q) (1 P e~alksTv ) (1 e(Q+a/ksT ) 
(3.2.5) 
where? 
7 2, (1 + 324) cos? 0c Bee 
7 2n3h i = 
and the integrals over q run from —oo to +00, leaving out a gap from 
q = —Q — me tog = —Q + me, where the square root would be imaginary. 


These rates include the effect of the Pauli Principle in the presence of partly 
filled lepton seas. For instance, the cross section for the process n + et > 
ptvis 20h AE? /Ue, the number density of positrons of each helicity with 


momentum between pe and pe + dpe is 4x p2dpe(2h)-*| exp(Ee /kpT) + 


-1 
| , and the fraction of unfilled antineutrino levels with energy E, is 


1 — [exp(E,/kgT,) + 1]! = [exp(—-E,/kgT,) + 1]! 


so the total rate per neutron of the process n + et > p+ Vis 
[0,0] 
A(ntet > p+) =A / E?p?dpe [exp(Ee/kpT) + 17! 
0 


x [exp(—Ey/kgTy) +17! 


Changing the variable of integration to g = —E\, = —Q-— Ee, we see that this 
partial rate is just the part of the integral (3.2.4) that runs from q = —co to 
q = —Q—me. Likewise, the part of the integral that runs from q = —O+me 
to q = 0 is supplied by the neutron-decay process n > p + e~ + v, with 
q = —E,, and the part of the integral that runs from q = 0 to q = +00 
arises from the processn+v —> p+e-, with q = Ey. Similar remarks apply 
to the integral (3.2.5). 

With the rates (3.2.4)-(3.2.5) known in principle, we can calculate the 
change in the ratio X, of neutrons to all nucleons from the differential 
equation 

dX), 


rF —h(n > p)X;, +p > nl — Xn). (3.2.7) 


2Here Gy; = 1.16637(1) x 107>GeV~? is the weak coupling constant, measured from the rate of 
the decay process wt —> e+ + ve + 0u; g4 = 1.257 is the axial vector coupling of beta decay, measured 
from the rate of neutron decay; and Oç is the Cabibbo angle, with cos 8c = 0.9745(6), measured from 
the rate of O!4 beta decay and other 0+ — 0* transitions. 
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As a check, note that for a time-independent temperature T equal to Ty, 
the two rates (3.2.4) and (3.2.5) would have the ratio 
A(p > n) 


A(n —> p) TER ( O/ksT) for T= Ty. (3.2.8) 


It follows then that in this case, Eq. (3.2.7) would have the time-independent 
solution expected in thermal equilibrium 


Xn Xn 


ans eka ( = O/keT) l (3.2.9) 


It is the inequality of T and T, as well as the time-dependence of these 
temperatures that drives X„/Xp away from its equilibrium value (3.2.9). 

For kgT > Q we can evaluate the integrals (3.2.4)-(3.2.5) by setting 
T, = T and Q = me = 0, so in this case 


+00 


4d 
A(n > p) =p > n) = af (1 i TOE elke ) 


7T 4 5 -1 T \ 


For comparison, the time ¢ for the temperature to drop to T from much 


—2 
higher values is given by Eq. (3.1.16) as 0.99 sec (7/10!°K) _ Also, 
H ~ 1/2t. The ratio 4/H is therefore 


T 3 


This ratio is larger than 1 for T > 1.1 x 10! K. Itis true that temperatures 
near this lower bound are not much larger than Q/k, and T, at this epoch 
is not precisely equal to T, so the rates A(n —> p) and A(p — n) are not 
precisely equal, and neither is given precisely by Eq. (3.2.10). Nevertheless 
Eq. (3.2.10) gives the order of magnitude of these rates in this temperature 
range, so we can still rely on the conclusion that A(n > p)/H and A(p > 
n)/H are large down to these temperatures. This means that the initial value 
of X, at temperatures larger than about 3 x 10!° K is given by the condition 
that the right-hand side of Eq. (3.2.7) should vanish: 


A(p > n) 
=> : 
A(p > n)+A(n > p) 


If X,, were larger or smaller than this, then the right-hand side of Eq. (3.2.7) 
would be large and respectively negative or positive, so X;,, would be rapidly 


Xn 


(3.2.12) 
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driven to the value (3.2.12). For T > 10!° K the neutrino temperature 
was within 1% of the photon-electron-positron temperature, so at these 
temperatures the rates have the ratio (3.2.8). Thus at the temperatures 
T > 3 x 10!° K at which the neutron fraction is given by the equilibrium 
formula (3.2.12), this formula gives simply 


1 
~ T+exp(Q/kpT) ` 


It is crucially important that this ratio is fixed at high temperatures by the 
condition of zero lepton chemical potential, so that we do not need to make 
any a priori assumptions about the initial neutron/proton ratio. The results 
of a numerical integration of Eq. (3.2.7) with initial condition provided by 
Eq. (3.2.13) are presented in Table 3.2.3 

We have seen that for kgT > me the ratios A(p > n)/H anda(n > p)/H 
varied roughly as T°, so there was a rather sharp end to the equilibrium era, 
in which à >> H, at a temperature between 3 x 10!° K and 10!° K. A little 
later, at a temperature between 10! K and 3 x 10° K the two-body and three- 
body neutron—proton conversion reaction rates became negligible, due in 
part to the disappearance of electron—positron pairs. Neutron—proton con- 
version continued mostly through the process of neutron decay, with a mean 
lifetime t, of 885.7+0.9 sec, so the neutron fraction became proportional to 
exp(—t/t,). This is confirmed by fitting the numerical results presented in 
Table 3.2 with an exponential that decays with the observed rate of neutron 
decay, which gives 


(3.2.13) 


n 


X, > 0.1609 exp ( (3.2.14) 


=f 
885.7 =) 
The conversion of neutrons into protons was eventually stopped by the 
formation of complex nuclei, in which neutrons are stable. As a guide to 
this process, note that in thermal (and chemical) equilibrium the number 
density of a nuclear species i is given by formulas like Eqs. (2.3.1): 


. QnmikgT N? 
ni = gyelti/kBT (=) exp(—m;/keT) , (3.2.15) 


where m; is the mass of nucleus i and g; is the number of its spin states. If 
these nuclei can be built up rapidly out of Z; protons and A; — Z; neutrons, 


3These results are somewhat different from those given in Table 15.5 of G&C, because in 1972 
only two types of neutrinos were known, and we are now assuming three types of massless neutrinos, 
which increases the expansion rate at a given temperature. We are also using the modern value 1.257 
for the axial vector coupling constant of beta decay in Eq. (3.2.6), instead of the value 1.18 used in 
G&C. 
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Table 3.2: Neutron fraction Xn as a function of temperature or time (with neglect of the 
formation of complex nuclei). 


T (K) t(sec) Xn 

10!? 0.0001 0.4962 

3x 10!! 0.0011 0.4875 
10!! 0.0099 0.4626 

3 x 1010 0.1106 0.3798 
1010 1.008 0.2386 

3x 10° 12.67 0.1654 
1.3 x 10° 91.09 0.1458 
1.2 x 10? 110.2 0.1425 
1.1 x 10° 135.1 0.1385 
10° 168.1 0.1333 

9 x 108 212.7 0.1268 
8 x 108 274.3 0.1182 
7 x 108 362.6 0.1070 
6 x 108 496.3 0.0919 
3 x 108 1980 0.0172 

108 17780 3.07 x 10719 


then the chemical potential of nuclei of type i is 
Hi = Zip + (Aj — Zi) en. (3.2.16) 


We can eliminate the unknown nucleon chemical potentials by forming the 
quantities 


Ni _ 8i 3/2 (2amnkpT NOs gE Bi/kgT 3.2.17 
Zi Ai-Z; Ai i h2 E ? ( ar ) 
Np Nn 
where B; is the binding energy, defined by 
mj = ZiMp + (A; — Z;)m, — Bi. (3.2.18) 
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(In deriving Eq. (3.2.17), we neglect the binding energy and the neutron— 
proton mass difference outside the exponential.) Eq. (3.2.17) can be 
expressed in terms of the dimensionless ratios 


Xi = ni/ny , Xp = Np/NN , Xn =Mn/NN , (3.2.19) 
where ny is here the number density of all nucleons. In these terms, 


Zi GZ; vA;—Z; 43/2_A;—1_B; 
X;= 5 Xp Xa Ziga eli leBi/kBT (3.2.20) 


where € is the dimensionless quantity 


1 
e= "Nh QxmykgT)* 
-3 —3/2 
= -11 a T 2 


in which we have used ny = 3Q BHe (ao/ a)? /82 Gmy for the number density 
of nucleons, bound or free. During the period of interest (after electron— 
positron annihilation) the temperature T goes as 1/a, so Eq. (3.2.21) may 
be written 


T T +3/2 5 
e = 1.46 x 10 (mx) gh“. (3.2.22) 
The coefficient € is very small for temperatures in the range of interest, so in 
equilibrium a nuclear species i is nearly absent until the temperature drops 
to the value 
fee 

= (Ages Ty lel 
For Rgh? ~ 0.02 this temperature is 0.75 10° K for deuterium, 1.4x10° K 
for H?, 1.3x 10° K for He’, and 3.1 x 10° K for Het, with only a very weak 
dependence on Qgh’. The binding energy per nucleon for heavier nuclei is 
similar to that of He*, so they have similar values of 77. 

If thermal and chemical equilibrium were really maintained during the 
time that the temperature drops from around 10!° K to below 10° K, then 
during this time He* and heavier nuclei would appear first, followed by 
He? and H? (which would later beta decay to He), followed finally by 
H?. But this is not what happens. The density at this time is too low for 
any but two-body reactions to compete with the expansion rate, so nuclei 
are built up by a chain of two-body processes: first p +n > d+y, 
then d +d —> H? +p and d +d —> He? +n, and next d + H? > 
Hef + n and d + He? > Hef + p, as well as slower processes involving 
photons. 


(3.2.23) 
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There is no trouble with the first step. The rate of deuterium production 
per free neutron is 


-3 
Ag = 4.55 x 107cm? /sec x np = 511 sec™! ( —_— Xp gh? 
10-20 
T 3 
= 2.52 x 10f sec7! (sax) X, ghk . (3.2.24) 
Multiplying by the time (3.1.27), this gives 
rat ~45 x 104 (sanz) Xp Rgh? > (3.2.25) 


which remains substantially greater than unity until well after the temperature 
drops below 10° K. Therefore the deuterium abundance during the period 
of interest is given to a good approximation by its equilibrium value, which 
according to Eq. (3.2.20) is 


Xa = 3V2XpXn€ exp (2) ' (3.2.26) 
kgT 
The trouble is that, because of the small binding energy of the deuteron, 
the temperature Ty ~ 0.7 x 10? K is quite small, so deuterons remained 
rare until long after He* would have been abundant in thermal equilibrium. 
With deuterons rare, the two-deuteron processes d + d — H? + p and 
d+d — He? +n had small rates per deuteron, blocking further nucleosyn- 
thesis. (The rarity of deuterons had no effect on the rate per deuteron of 
radiative processes like p + d > He? + y and n + d — H? + y, and these 
reactions are included in modern nucleosynthesis calculations, but they have 
intrinsically small cross sections.) When finally the temperature dropped 
below T4 the neutrons that were still extant were very rapidly assembled 
into the most deeply bound of the light elements, Het. Further cosmo- 
logical nucleosynthesis was blocked by the non-existence of stable nuclear 
species with atomic weight 5 or 8. (In stars this blockage is overcome? by the 
brief formation of the unstable Be® nucleus in collisions of two He* nuclei, 
followed by resonant capture of another He’ nucleus to form an excited state 
of C!?, but the time available in the early universe was too short for this to 
have been effective then.) Thus, to a good approximation, the fraction by 
weight Y of He* formed in the early universe is just equal to twice? the 
fraction X,, of all nucleons that are neutrons (because in He* each neutron 


4E. E. Salpeter, Astrophys. J. 115, 326 (1952). 
SIn a little more detail, with nye helium nuclei and ny hydrogen nuclei per unit proper volume, the 
helium mass per unit proper volume in atomic mass units is 47,42, while the total mass per unit proper 
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is accompanied with a bound proton) at the time that deuterium becomes 
abundant enough to allow the build-up of heavier nuclei. 

This actually occurs at a temperature somewhat above Tg. The first steps 
in building up heavier elements from deuterium are the reactions d + d > 
H? +p and d +d — He? +n. These are exothermic processes, so for 
initial velocity v — 0 these have cross sections proportional to 1/v. At the 
temperatures of interest here, we have 


(d+ > H? +p)v) ~ 1.8 x 10717 cm?/sec , 


(o(d +d— He? + n)v) ~ 1.6 x 107!’ cm?/sec , 


so the total rate of these processes per deuteron is® 


A= [o(a+a > H? +p)v) + (o(d +d > He? +n)v)] Xin 


~ 1.9 x 107 (7/10"°K) (227?) Xa sec”! 


This may be compared with the expansion rate, which after electron—positron 
annihilation is given by Eq. (3.1.27) as H = 1/2t = 0.28(T/10!°K)? sec7!, 
so for T in the neighborhood of 10° K, we have A = H at Xy ~ 1.2 x 
10-7/QzBh?, which is 0.6 x 1075 for Rgh? = 0.02. This value is reached in 
thermal equilibrium at a temperature ~ 10? K, with only a weak dependence 
on Qgh’, so nucleosynthesis began at around 10° K, not at 0.75 x 10? K. 
According to table 3.2, this happened when t = 168 seconds, so according 
to Eq. (3.2.14), the abundance by weight of helium formed at this time was 
about 


Y, ~ 2 x 0.1609 x exp(—168/885) ~ 0.27. 


(The conventional subscript p stands for “primordial.”) The larger the 
nucleon density, the higher the temperature at which nucleosynthesis began, 
and so the less time there was for neutron decay before nucleosynthesis, 
leading to a higher final He* abundance. The results of modern calculations 
are usually given for different values of 7, the ratio of nucleons to photons, 
which according to Eqs. (2.1.12) and (2.1.13) is related to gh? by 


Rgh? = 3.65n x 10’, 


volume is 4nHe + ny, so the fractional abundance by weight of helium is Y = 4nye/(4nHe + ny). But 
the numbers of protons and neutrons per unit proper volume are np = 2nye + ny and ny = 2npe, SO 
the fraction of nucleons that are neutrons is Xn = 2nye/(4npye + my). Hence Y = 2Xn. 

6y, Mukhanoy, ref. 1. 
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so for instance Qgh* = 0.02 corresponds to n = 5.5 x 107!9. (The 
nucleon density at a given temperature can be expressed in terms of 7 
without knowing the present microwave background temperature, which 
made n a more convenient parameter than Qg/7 before the 1990s, when the 
present microwave background temperature was not yet accurately known.) 
Detailed calculations’ give Y, = 0.232 for n = 2 x 107!° and Y, = 0.240 
for n =4 x 10719, 

The primordial helium/hydrogen ratio is inferred from spectroscopic 
study of HII regions (regions of ionized hydrogen) containing low 
abundances of “metals” (elements other than hydrogen or helium), espe- 
cially in blue compact galaxies which have not yet formed many stars. Obser- 
vations in the 1990s were divided between those giving lower values® Y, = 
0.234+0.002+0.005, and higher values” Y, = 0.243+0.003. Even with this 
division among the observers, the helium abundance clearly called for a hot 
universe, with 7 in the range of 1071? to 5x 107!°. More recent observations 
of HII regions, combined with new atomic data used to interpret the spec- 
troscopic observations, have led to a more precise determination: 1° Y, = 
0.2477 + 0.0029, corresponding to n = (5.813 + 1.81) x 107!°. The uncer- 
tainty in the value of 7 given by observations of helium abundance is still 
quite large, because Yp is only weakly dependent on the baryon/photon 
ratio. 

Helium abundance is more useful as a test of the expansion rate than 
of the value ofn. For instance, if there were four flavors of massless neutrinos 
that went out of equilibrium at temperatures between 10!! K and 10!° K, 
then at temperatures below 3 x 10? K, the effective number of particle species 
would have been 2 + (7/8) - 8 - (4/11)4/3 = 3.817 instead of 3.363, so the 
time required to drop from 3 x 10? K to any lower temperature would have 
been shortened by a factor ./3.363/3.817 = 0.94. Shortening the time to 
reach a given temperature increases the neutron fraction at that temperature, 
and hence increases the abundance of helium produced at the temperature 
T ~ 10°K of nucleosynthesis. It is actually the shortening of the time 
scale at temperatures between 3 x 10!9 K and 3 x 10? K, when electron- 
positron pairs were disappearing, that would have the largest effect on the 
helium abundance. Detailed calculations show!! that for each additional 


TR. E. Lopez and M. S. Turner, Phys. Rev. D 59, 103502 (1999) [astro-ph/9807279]. 

8K. A. Olive and G. Steigman, Astrophys. J. Suppl. 97, 49 (1995); K. A. Olive, E. Skillman, and 
G. Steigman, Astrophys. J. 483, 788 (1997). 

°Y. I. Izotov, T. X. Thuan, and V. A. Lipovetsky, Astrophys. J. 435, 647 (1994); Astrophys. J. Suppl. 
108, 1 (1997); Y. I. Izotov and T. X. Thuan, Astrophys. J. 497, 227 (1998); 500, 188 (1998). 

10A, Peimbert, M. Peimbert, and V. Luridiana, Astrophys. J. 565, 668 (2002); V. Luridiana, 
A. Peimbert, M. Peimbert, and M. Cerviño, Astrophys. J. 592, 846 (2003); M. Peimbert, 
V. Luridiana, and A. Peimbert, Astrophys. J. 667, (2007) [astro-ph/0701580]. 

lR. E. Lopez and M. S. Turner, ref. 7. 
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neutrino species this effect increases Y by an amount 0.01276 for n = 107!°, 
and by 0.01369 for n = 5 x 107!°. The agreement between theory and 
observation for the helium abundance gave an upper bound of four light 
neutrino flavors before this was accurately measured to be just three flavors in 
measurements of the Z° decay width, part of which is due to the unobserved 
processes Z? — v+7. (Strictly speaking, the Z? width measures the number 
of neutrinos with masses less than mz/2 = 45.6 GeV, while the helium 
abundance measures the number of neutrinos with masses less than about 
1 MeV.) Although the number of light neutrino flavors is now definitely 
known on the basis of Z? decay to be no greater than three, there may be 
other light particles left over from the very early universe that contribute to 
the expansion rate, and for these the helium abundance continues to provide 
useful upper bounds. 

The nuclear reactions that built up helium from free neutrons at T ~ 
10° K were not perfectly efficient, but left over a small residue,!? like an 
unburned ash, of the light elements H*, H?, He?, Li’, and Be’. The nuclei 
of H? decayed later by B+ decay to He?, and the nuclei of Be’ decayed 
later by electron capture to Li’, leaving us with H?, He*, and Li’, as well as 
protons and Het. The calculated abundances are shown in Figure 3.1. 

The higher the baryon density, the more complete will be the 
incorporation of neutrons into Het, and hence the smaller the resulting 
abundance of deuterium. We saw earlier that nucleosynthesis began when 
Xa ~ 1.2 x 1077/Qzgh?, or Xa ~ 0.6 x 1075 for Rgh? = 0.02. The deu- 
terium fraction continued to rise for a while, as the temperature dropped 
and the exponential in Eq. (3.2.26) increased, but X4 then decreased again 
as the incorporation of free neutrons into deuterium reduced the factor Xn 
in Eq. (3.2.26), and deuterium was converted to H? and He’, and thence to 
Het. The final results, shown in Figure 3.1, are not very different from the 
deuterium fraction Xy ~ 1.2 x 1077/Qgh? that we found at T = 10° K, 
and in particular exhibit a strong decline with increasing values of Qh’. 

The measurement of the deuterium abundance is complicated by the fact 
that deuterium has a small binding energy, and can readily be destroyed in 
stars. Any measurement of the deuterium abundance therefore gives a lower 
bound on the primordial deuterium abundance, and hence an upper bound 
on the baryon density. 

In the past, the deuterium/hydrogen ratio was measured in various ways: 


1. Interstellar medium. Spectroscopic studies of the interstellar medium 
gave a deuterium/hydrogen ratio! of (1.60 + 0.097)?) x 1075. 


12The calculated abundances cited below are given by S. Burles, K. M. Nollett, and M. S. Turner, 
Astrophys. J. 552, L1 (2001). 
By L, Linsky et al., Astrophys. J. 402, 694 (1993); 451, 335 (1995). 
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Figure 3.1: Calculated primordial abundances of deuterium, He?, and Li’ relative to hydro- 
gen, and the fraction Yp of the primordial mass of the universe in He’, as functions of 
mo = 10!%p, where n is the ratio of nucleons to photons in the present universe. The 
widths of the bands indicate the effect of uncertainties in nuclear reaction rates. From 
G. Steigman, Int. J. Mod. Phys. E15, 1 (2006) [astro-ph/0511534]. 


2. Solar wind. Deuterium was converted to He? in the sun before it went 
onto the main sequence, so measurements of He? in the solar wind and 
meteorites is believed to give the total pre-solar value of He? and deu- 
terium. Subtracting the abundance of He? obtained in other ways gave!4 a 
deuterium/hydrogen ratio of (2.6 + 0.6 + 1.4) x 107°. 


3. Jovian atmosphere. Spectroscopic studies of the Jovian atmosphere gave! 
a deuterium/hydrogen ratio of (5 +2) x 1075. 


5 


More recently, it has been possible to measure the deuterium/hydrogen 
ratio in very early intergalactic matter, by observing deuterium as well 
as hydrogen absorption lines in the spectra of quasistellar objects, due to 


14K _ A. Olive, lectures given at the Advanced School on Cosmology and Particle Physics, Peniscola, 
Spain, June 1998, and Theoretical and Observational Cosmology Summer School, Cargese, France, 
August 1998 [astro-ph/9901231], and references therein. 

15H, B. Niemann et al. Science 272, 846 (1996). 
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Table 3.3: Five quasi-stellar objects in whose spectra deuterium and hydrogen absorption 
lines are observed, together with the redshift of the intergalactic cloud responsible for the 
absorption, and the ratio of the numbers of deuterium and hydrogen atoms in the clouds 
inferred from the relative strength of the absorption lines, from ref. 17. (Observational 
uncertainties represent one standard deviation.) 


QSO z D/H 

PKS 1937-1009 3.572 3.25 + 0.3 x 1075 
Q1009+299 2.504 39875 x 1075 
HS 0105+1619 2.536 2.54 + 0.23 x 1075 
Q1243+3047 2.525675 2.424933 x 1075 
Q2206-199 2.0762 1.65 + 0.35 x 1075 


absorption in intervening intergalactic clouds of large redshift.!° In 2003 the 
results of several years of observations of deuterium and hydrogen absorp- 
tion lines in the spectra of four quasi-stellar objects were put together with 
results for one more QSO, !7 with the results shown in Table 3.3. The best 
value of the deuterium/hydrogen number ratio was found to be 218 x 
1075, from which is inferred a baryon/photon ratio n = 5.9 + 0.5 x 10719, 
corresponding to 


Qgh? = 0.0214 + 0.0020. 


Even for Ho as small as 50 km/sec/Mpe, it was clear that Qg is much less 
than the fraction that all non-relativistic matter contributes to the critical 
density, which we have seen had been given as Qm ~ 0.2 by studies of 
galaxy clusters and as Qy ~ 0.3 by the redshift—distance relation of type 
Ia supernovae. It is this discrepancy that provided the original evidence for 
non-baryonic dark matter in the universe. 

The discrepancy has become sharper through measurements of 
anisotropies in the cosmic microwave background. As we will see in Sec- 
tion 7.2, from these measurements it is possible to infer that Qgh? = 
0.02234 9-007 and Qah? = 0.127+9207, indicating a total mass density that 
is from 5 to 6 times larger than the density of ordinary baryonic matter. The 
nature of the missing matter is discussed in Section 3.4. 


16For a summary of early results, see K. A. Olive, ref. 13; S. Sarker, talk at the Second International 
Workshop on Dark Matter in Astro- and Particle Physics, Heidelberg, July 1998 [astro-ph/9903183]. 

11D, Kirkman, D. Tytler, N. Suzuki, J. M. O’Meara, and D. Lubin, Astrophys. J. Suppl. 149, 1 (2003) 
[astro-ph/0302006], and references cited therein. 
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The primordial abundance of He?, like that of deuterium, is a 
monotonically decreasing function of the baryon/photon ratio. On the 
basis of a long term study of galactic HII regions and planetary nebula, 
a 2002 study!® concluded that the He?/H ratio in interstellar space is less 
than 1.1 + 0.2 x 1075, which is consistent with the value 1.04 + 0.06 x 
107° calculated for Rgh? = 0.0214. But, unlike deuterium, He? is both 
produced and destroyed in stars, so it is not clear whether the observed 
interstellar abundance really represents the primordial abundance of He?. 
Indeed, it had been thought that low-mass stars would inject a good deal 
of He? into the interstellar medium, in which case the apparent agree- 
ment between the amount observed in the interstellar medium and the 
amount expected from cosmological nucleosynthesis would actually repre- 
sent a discrepancy.!? This apparent discrepancy may have been removed 
by detailed calculations?” of the movement of He? into the interior of 
these low-mass stars, which indicate that these stars do not in fact emit 
much He’. 

Some Li® is produced cosmologically, but in such small quantities (an 
abundance of about 107!3 to 10714 that of hydrogen) that it has generally 
not been considered useful as a test of cosmological theories.2! Much more 
attention has been given to Li’. Its abundance has a more complicated 
dependence on the baryon/photon ratio, because Li’ was formed in two 
different ways: directly, by the reactions H? + He* > Li’ + y, and indi- 
rectly by He? +He* —> Be’ +y, followed much later by e~ + Be? > v+ Li’. 
As we go to higher baryon densities, the amount of Li’ produced directly 
increases at first, but then begins to decrease as Li’ is destroyed in the 
reaction p + Li’ > Hef + Het. Eventually the indirect reaction takes over, 
and the Li’ abundance rises again. The minimum Li’/hydrogen ratio is 
calculated to be about 2 x 107!°, and is reached at a baryon/photon ratio 
of about 3 x 107!9, corresponding to Q gh? = 0.01. The observed Li’ /H 


ratio was reported? in 2000 to be Oe, x 107!9, close to this mini- 


mum. A subsequent study?’ of 63 dwarf stars in the galactic halo gave a 
Li’/H ratio of (2.37 + 0.05) x 107!°. Either result is less than the value 
3 x 107! predicted for Qgh* = 0.0214, but it is plausible that although 


18T. M. Bania, R. T. Rood, and D. S. Balser, Nature 415, 54 (2002). 

19N. Hata et al., Phys. Rev. Lett. 75, 3977 (1995); K. A. Olive et al., Astrophys. J. 444, 680 
(1995). 

20P P. Eggleston, D. S. P. Dearborn, and J. C. Lattanzio, Science 314, 1580 (2006). 

2l For a review, see E. Vangioni-Flam, M. Cassé, R. Cayrel J. Audouze, M. Spite, and F. Spite, New 
Astron. 4, 245 (1999). 

22T, K. Suzuki, Y. Yoshii, and T. C. Beers, Astrophys. J. 540, 99 (2000) [astro-ph/0003164]. For earlier 
observations, see M. Spite and F. Spite, Nature 297, 483 (1982); S. G. Ryan, J. E. Norris, and T. C. Beers, 
Astrophys. J. 523, 654 (1999). 

23J, Melendez and I. Ramirez, Astrophys. J. 615, L33 (2004). 
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Li’ is produced in stars and in the interaction of cosmic rays with matter, 
the Li’ abundance has been depleted by convection in stellar atmospheres. 
Observation of Li’ abundances in stars of varying temperature in the glob- 
ular cluster NGC 6397 gave results in agreement with a theory of convective 
depletion of Li’, with an assumed initial Li’/H ratio equal to the expected 
value 3 x 107! (calculated taking Qgh? to have the value estimated from 
deuterium abundance and microwave background anisotropies).7+ With 
the one possible exception of Li’, there is now complete agreement between 
observations of light element abundances and calculations of cosmologi- 
cal nucleosynthesis, adopting the value Qgh? provided by observations of 
anisotropies in the cosmic microwave background. 

Although the baryon mass density inferred from cosmological 
nucleosynthesis and the cosmic microwave background is considerably less 
than the total mass density, it is also considerably greater than the density of 
baryonic matter observed in stars and luminous interstellar matter.” Some 
of this dark baryonic matter is in intergalactic space, but a fair fraction is 
believed to be present in galaxies, in the form of brown dwarf stars and 
clouds of hydrogen molecules.*° But this is more a problem for the astro- 
physics of galaxies than for cosmology. 


3.3 Baryonsynthesis and leptonsynthesis 


We saw in the previous section that the ratio 7 of nucleons to photons at 
the time of nucleosynthesis had the tiny value ~ 5 x 107!. At earlier 
times, when the temperature was above 10!* K, nucleons would not yet 
have formed from their three constituent quarks, and there would have 
been roughly as many quark—antiquark pairs in thermal equilibrium as 
photons. But the conservation of baryon number (one-third the number 
of quarks minus the number of antiquarks) during the annihilation process 
tells us that before annihilation there must have been a slight excess, roughly 
of order n per photon, of quarks over antiquarks, so that some quarks 
would survive to form nucleons when all the antiquarks had annihilated 
with quarks. There was also a slight excess of electrons over positrons, to 
maintain the charge neutrality of the universe. It is conceivable that there 
is a compensating excess of antineutrinos over neutrinos, so that the total 
lepton number density (the number density of electrons, muons, tauons, and 
neutrinos, minus the number density of their antiparticles) vanishes, but it 
seems more natural to assume that before lepton—antilepton annihilation 


24A. J. Korn et al., Nature 442, 657 (2006) [astro-ph/0608201]. 
25M. Fukugita, C. J. Hogan, and P. J. E. Peebles, Astrophys. J. 503, 518 (1998). 
26See e.g. M. Roncadelli, Recent Research Devel. Astron. & Astrophys. 1, 407 (2003). 
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there was also a slight imbalance of leptons and antileptons, comparable to 
the excess of quarks over antiquarks. 

These tiny imbalances in the numbers of quarks and antiquarks and 
of leptons and antileptons might be explained if the baryon and lepton 
number densities were generated by physical processes in a universe that at 
some early time had equal number of particles and antiparticles of all sorts. 
We could then hope to calculate n from first principles, and understand why 
it is so small. In 1967 Sakharov! outlined three conditions that must be met 
for this to be possible: 


1. Obviously, in order for an excess of baryons over antibaryons or an 
inequality of leptons and antileptons to arise in a universe that begins 
with equal numbers of particles and antiparticles of each type, some phys- 
ical processes must violate the conservation of baryon number or lepton 
number. 


2. A universe with equal numbers of particles and antiparticles of each 
type (and each momentum and helicity) is invariant under the symmetry 
operators C (the exchange of particles with antiparticles) and CP (the 
exchange of particles with antiparticles, combined with a change of sign 
of all three-dimensional coordinate vectors), while a state with an excess 
of baryons over antibaryons or an imbalance of leptons and antileptons 
is clearly not invariant under either C or CP. Hence to produce such an 
state, some physical process must violate invariance under both C and CP. 
It is true that, whether or not C and/or CP are conserved, any relativistic 
quantum field theory will respect a symmetry” under CPT, the simultaneous 
exchange of particle with antiparticles, combined with a change of sign of 
all three-dimensional coordinate vectors, combined with a change in the 
direction of time’s flow, but this does not prevent the production of baryon 
or lepton number, because the time-reversal symmetry T is violated by the 
expansion of the universe. 


3. A little less obviously, in order to produce an excess of baryons over 
antibaryons or an imbalance of leptons and antileptons out of a state 
with equal numbers of particles and antiparticles, the universe must at 
some time depart from a state of thermal (including chemical) equilibrium. 
This is because in a state of thermal equilibrium, if baryon and/or lepton 
conservation are not respected, and all conserved quantities like electric 
charge vanish, then since chemical potentials must be conserved in all reac- 
tions, all chemical potentials must vanish. The CPT symmetry implies 
that even if C and CP are not conserved, the masses of particles and their 


1A. D. Sakharov, JETP Lett. 5, 24 (1967). 
2See QTF, Vol. I, Sec. 5.8. 
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antiparticles are precisely equal, so with vanishing chemical potentials par- 
ticles and antiparticles will have identical distribution functions, such as 
(3.1.28) with u = 0 for ideal gases. Thus, whatever the rates for various 
processes, no net baryon or lepton number will be produced.* An exception 
to this reasoning is presented at the end of this section. 


All three conditions are now known to be satisfied: 


(1) There is no direct experimental evidence for the nonconservation of 
baryon number, but a very weak baryon number nonconservation is expected 
according to modern views of the standard model of elementary parti- 
cles. According to these views, the standard model is not a fundamental 
theory, which might be expected to be contain only interactions whose 
coupling parameters are either dimensionless or proportional to positive 
powers of mass, so that all infinities that arise in the standard model can 
be absorbed into a renormalization of these coupling parameters. Rather, 
we now think that the standard model is only an effective field theory, 
valid at energies much less than some fundamental mass scale M, which 
might be the Planck mass Mp = G7! = 1.22 x 10!° GeV, or perhaps 
the energy scale ~ 10!> to 10!6 GeV at which the three independent (suit- 
ably normalized) gauge coupling parameters of the standard model become 
equal. We would expect such an effective field theory to contain every pos- 
sible interaction allowed by the gauge symmetries of the strong, weak, and 
electromagnetic interactions, but all but a finite number of these coupling 
parameters will have the dimensions of negative powers of mass. These 
“non-renormalizable” couplings are thus suppressed at energy E « M by 
powers of E/M. Now, the gauge symmetries of the standard model do 
not allow any unsuppressed interactions among quarks and leptons that 
violate baryon or lepton number, so this picture makes it plausible that 
baryon and lepton number would be automatically conserved to a good 
approximation for energies E < M, even if baryon and lepton conserva- 
tion are not respected by whatever fundamental theory describes physics 
at energies of order M. In other words, baryon and lepton number con- 
servation may be mere “accidental” symmetries. In this case, there is no 
reason to exclude any suppressed interactions that violate conservation of 
baryon and lepton number. The least suppressed interactions of this sort 
are an interaction involving two lepton doublets and two scalar doublets, 
which is suppressed by a factor M~! and violates lepton but not baryon con- 
servation, and an interaction suppressed by a factor M~? involving three 
quark fields and one lepton field, which violates both lepton and baryon 


3For a more detailed argument and references to earlier discussions, see S. Weinberg, Phys. Rev. 
Lett. 42, 850 (1978). 


175 


3 The Early Universe 


conservation.* (Also, as we shall see below, there is a quantum-mechanical 
violation of baryon-number and lepton-number conservation in the stan- 
dard electroweak theory.) There is experimental evidence for the first but not 
the second interaction. With a coupling parameter of order 1/(10!° GeV), 
when the neutral scalar fields are replaced by their vacuum expectation val- 
ues, this interaction provides a neutrino mass of order 107? eV, in good 
agreement with the results of neutrino oscillation experiments. (The exis- 
tence of non-zero neutrino masses means that helicity +1 /2 neutrinos, which 
are conventionally assigned lepton number +1, can be changed to helicity 
—1/2 neutrinos with lepton number —1 by a Lorentz transformation.) The 
second interaction would lead to decay processes like p > z? + e+, with 
decay rates of order mi /M*h. Such events have not been seen, but could 
easily have escaped detection. 


(2) The violation of invariance under C was discovered in 1957, while the 
violation of invariance under CP was discovered in 1964.5 


(3) The expansion of the universe tends to pull states out of thermal equilib- 
rium, either because the cooling temperature makes reaction rates decrease 
below the expansion rate, or because as it cools the universe goes through 
first-order phase transitions, similar to the condensation of water vapor 
or the freezing of liquid water. Although this in itself does not violate 
the conservation of baryon or lepton number, it opens the door for phys- 
ical processes that do violate these conservation laws, as well as C and 
CP conservation, to create an imbalance between baryons and antibaryons 
and/or between leptons and antileptons. 


All this just goes to show that it is possible for physical processes to 
produce a non-zero cosmological baryon and lepton number. It remains 
to find a specific theory in which the observed ratio of baryons to photons 
could be produced. There are several theories of this type: 


1. Delayed decay of heavy particles’ 

Suppose there is a species of heavy “X” particle, which decays into a pair 
of different channels, with baryon numbers Bı and B and lepton numbers 
Lı and Ly, and branching ratios r and | — r. (For instance, in some grand 


4For a review, with references to the original literature, see QTF, Vol. II, Section 21.5. 

>See QTF, Vol. I, Section 3.3. 

6For the earliest attempts in this direction, see M. Yoshimura, Phys. Rev. Lett. 41, 281 (1978); 42, 
746(E) (1979); S. Dimopoulos and L. Susskind, Phys. Rev. D 18, 4500 (1979); Phys. Lett. 81B, 416 
(1979); A. Yu. Ignatiev, N. V. Krosnikov, V. A. Kuzmin, and A. N. Tavkhelidze, Phys. Lett. 76B, 436 
(1978); B. Toussaint, S. B. Treiman, F. Wilczek, and A. Zee, Phys. Rev. D 19, 1036 (1978); J. Ellis, M. K. 
Gaillard, and D. V. Nanopoulos, Phys. Lett. 80B, 360 (1979); 82B, 464(E) (1979). 

7S. Weinberg, Phys. Rev. Lett. 42, 850 (1978). 
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unified theories there are “leptoquarks” with mass of order 10!> GeV, that 
decay into either two quarks, with baryon number 2/3 and lepton number 
zero, or into a lepton and antiquark, with baryon number —1/3 and lepton 
number +1.) The antiparticle will then decay into channels with baryon 
numbers —B, and —B> and lepton numbers —L; and —L», with branch- 
ing ratios 7 and 1 — 7. (Invariance under CPT tells us that any particle 
has the same total decay rate as its antiparticle, but as long as C and CP 
conservation are violated, it is possible for particles and antiparticles to have 
different branching ratios for their different decay channels.) The average 
total baryon number produced in the decay of one X particle and the decay 
of one of the corresponding antiparticles is then 


AB=rBi+(1—1r)Bo — rBı — (1 — r)B2 = (r — r)(Bı — B2), (3.3.1) 


and likewise for lepton number. 

Similarly, there may be some heavy “N” particle that is its own 
antiparticle, and that decays with branching ratio r into one channel with 
baryon number Bı and lepton number Lı, and with branching ratio 1 — r 
into the antichannel with baryon number — B; and lepton number — Lı. On 
the average, each decay produces a baryon number 


AB =rBı — (1 —r)B1, (3.3.2) 


and likewise for lepton number. If C and CP conservation are violated, 
then it is possible to have r # 1 — r, so that a net baryon number and lepton 
number may be produced. For instance, in some grand unified theories 
there are neutral fermions that decay both into scalar particles and leptons 
and into their antiparticles, producing a net lepton number if the branching 
ratios for these channels are unequal. 

Such processes do not produce any net baryon or lepton number in 
equilibrium, because the inverse to the decay processes will destroy pre- 
cisely as much baryon and lepton number as the decay processes create. 
The conditions for thermal equilibrium will be violated if H falls below 
the X-particle decay rate, but as long as kgT remained above all particle 
masses, whatever the rates of various processes, the expansion preserved 
the equilibrium form for all particle distributions, with a redshifted temper- 
ature T œ 1/a. Specifically, if kgT was still above all particle masses when 
H = 4/a fell below the decay rate Ty, the inverse decay process would 
re-create as many X particles as had decayed. On the other hand, if at 
the time that H ~ Ty, kgT was less than the mass my of the X particles, 
then the Boltzmann factor e~”"*/8T would have blocked the inverse decay, 
and the X particles and antiparticles would have disappeared, yielding a net 
baryon number (r—7)(B, — B2) for each original heavy particle—antiparticle 
pair. 
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The condition for this to work is then that kgT < my at the temperature 
when H =T y. We can estimate that (here taking i = c = 1) 


4 

ie {oe NA = 1.66(kgT)? N! /mp (3.3.3) 
where mp = GT! = 1.22 x 10!9 GeV, and M is the total number of 
helicity states of all elementary particles and antiparticles, with an extra 
factor 7/8 for fermions. (We assume here that the main contribution to 
the energy density of the universe at these temperatures comes from the 
large number M of highly relativistic particle types, rather than from the 
X particles themselves.) The decay rate of the X particle will be my times 
some dimensionless parameter wy, which characterizes the strength of the 
interactions responsible for the decay and the number of decay channels. 
Hence decays start to be significant at a temperature Ty, given by 


kpTy © aymymp/N'/2 (3.3.4) 


For this to be less than my, we must have 
my > N`! aymp . (3.3.5) 


This is a fairly severe lower bound on my. For instance, if the X particles 
decay through ordinary electroweak interactions, then ay ~ 1077, so if 
N ~ 100 we must have my greater than about 10!° GeV. From the point of 
view of theories that unify the strong and electroweak interactions, this is 
not an unreasonable value for the mass. 

Assuming that this condition is satisfied, we can easily use Eq. (3.3.1) to 
make an estimate of n. At temperatures far above the heavy particle mass 
the number density of pairs of X particles and antiparticles is of the same 
order as the number density of photons. The entropy density (using energy 
units for temperature, with kg = 1) at this time is of the order of M times the 
number density of photons, so the ratio of the number densities of X and Y 
pairs to the entropy density is of the order of 1/M. With Eq. (3.3.4) satisfied, 
after the disappearance of these pairs the ratio of the baryon number to the 
entropy densities will be of order (r — 7)(B, — Bz2)/N. The entropy density 
varies as a~>, and provided that baryon number is subsequently conserved so 
does the baryon number density, and so the baryon number to entropy ratio 
will remain unchanged. The present entropy density is of the order of the 
photon number density, and at present the only baryons are nucleons, with 
no antinucleons, so we expect a nucleon to photon ratio 


n ~ (r —F)(By — Bo)/N . (3.3.6) 
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Typically Bı — B2 is of order unity (in the leptoquark example mentioned 
above, it is equal to unity) but r —7 is generally very small, both because CP 
conservation is weakly violated, and because the CPT theorem tells us that 
r =F in the lowest order of perturbation theory. The precise value of r — 7 
is very model-dependent, but values of 7 of the desired order, 5 x 107!°, 
appear quite natural.® This idea runs into difficulty in inflationary theories, 
which as we will see in Chapter 10 generally require that the temperature was 
never high enough to produce particles with masses satisfying Eq. (3.3.5). 


2. Nonperturbative electroweak baryon and lepton 

number nonconservation 

In the standard model of weak, electromagnetic, and strong interactions 
baryon and lepton number are automatically conserved to all orders of per- 
turbation theory, but not when certain non-perturbative effects are taken 
into account.? This produces reactions that violate baryon and lepton 
number conservation, but such reactions are suppressed at low tempera- 
tures by a factor exp(—827/g) ~ 107162, where g is the SU (2) electroweak 
coupling constant. This tiny exponential is actually a barrier penetration 
factor, which accompanies the quantum mechanical tunneling transition 
through the barrier between topologically different configurations of the 
gauge fields. At high temperatures, above about 300 GeV, thermal fluc- 
tuations allow passage over this barrier, and the exponential suppression 
disappears.!° Nevertheless, by themselves these reactions do not produce 
an appreciable net baryon or lepton number, both because they take place 
at a time of nearly perfect thermal equilibrium,!! and because they are 
suppressed by small parameters associated with the need to violate CP con- 
servation as well as the conservation of baryon and lepton number.!? 


3. Leptogenesis!? 


Although the non-perturbative effects of electroweak interactions described 
in the previous paragraph do not by themselves provide a way of accounting 


8p. V. Nanopoulos and S. Weinberg,Phys. Rev. D 20, 2484 (1979). 
9G. t Hooft, Phys. Rev. Lett. 37, 8 (1976). Also see QTF, Vol. II, Section 23.5. 

10V, A. Kuzmin, V. A. Rubakov, and M. E. Shaposhnikov, Phys. Lett. 155B, 36 (1985). The transition 
between field configurations is dominated by intermediate field configurations known as sphalerons; see 
N. S. Manton, Phys. Rev. D 28, 2019 (1983); F. R. Klinkhammer and N. S. Manton, Phys. Rev. D 30, 
2212 (1984); R. F. Dashen, B. Hasslacher, and A. Neveu, Phys. Rev. D 10, 4138 (1974). 

' The absence of a first-order phase transition in the electroweak theory is shown by K. Kajantie, M. 
Laine, K. Rummukainen, and M. Shaposhnikov, Nucl. Phys. B 466, 189 (1996); K. Rummukainen, 
M. Tsypin, K. Kajantie, and M. Shaposhnikov, Nucl. Phys. B 532 (1998); F. Csikor, Z. Fodor, and 
J. Heitger, Phys. Rev. Lett. 82, 21 (1999); and earlier references cited therein. 

12M. E. Shaposhnikov, JETP Lett. 44, 465 (1986); Nucl. Phys. B 287, 757 (1987). 

BM. Fukugita and T. Yanagida, Phys. Lett. B 174, 45 (1986). For a review, see W. Buchmiiller, R. D. 
Peccei, and T. Yanagida, Ann. Rev. Nucl. Part. Sci. 55, 311 (2005) [hep-ph/0502169]. 
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for the observed baryon/photon ratio, they can convert a cosmological lep- 
ton number density into a baryon number density, or vice versa. The only 
truly conserved quantum numbers in the SU (3) x SU(2) x U(1) standard 
model are those associated with gauge symmetries — the electroweak isospin 
generator T3, the electroweak hypercharge Y (defined so that the electric 
charge in units of e is 73 — Y), and a pair of generators of the SU (3) gauge 
group of quantum chromodynamics — together with B—L, the total baryon 
number minus the total lepton number, which is conserved because of a can- 
celation between Feynman diagrams containing loops of quarks or leptons. 
This creates a further problem for the proposal that the observed baryon 
number density of the universe is created in the decay of a leptoquark into 
both two quarks and into a lepton and antiquark. Although these channels 
have different values for B and L, they both have the same value (equal to 
2/3) for B — L. Hence if the universe starts with equal numbers of particles 
and antiparticles of all types, then even if leptoquark decay produces equal 
non-zero baryon and lepton number densities, all truly conserved quanti- 
ties will remain zero, so when thermal equilibrium is established at lower 
temperatures it will be with zero values for all chemical potentials. (This 
is demonstrated below.) Such a state has equal numbers of particles and 
antiparticles, and hence zero densities of B and L as well as B — L. 

On the other hand, if some heavy particle (such as the N particle 
mentioned above) in the early universe decays in such a way as to pro- 
duce a non-zero density of B — L this will persist through the period of 
thermal equilibrium, though the relative densities of B and L may change. 
In general, whatever mixture of baryon and lepton number is produced 
when the heavy particle decay, and even if only lepton number is produced, we 
would expect the densities of baryon and lepton number to be comparable 
in a subsequent period of thermal equilibrium. 

This can be made quantitative.'4 Suppose in thermal equilibrium there 
are a set of conserved quantum numbers Q4, such as 73, Y, and B — L. 
Suppose also that there are several species i of particles in equilibrium, 
such as quarks, leptons, etc., each carrying a value qa; for the quantum 
number Qa. The chemical potentials u; for these particles must be conserved 
for all reactions in thermal equilibrium, which requires that they be linear 
combinations of the conserved quantum numbers: 


Hi = Y daha > (3.3.7) 
a 


with coefficients 4a that can be regarded as chemical potentials for the 
different conserved quantities. The densities of the different particle species 


14) A. Harvey and M. S. Turner, Phys. Rev. D 42, 3344 (1990). 
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can then be expressed as functions of the ua and the temperature, and these 
relations can be used to calculate the densities n4 of the different conserved 
quantum numbers as functions of the ua and the temperature. But there are 
just as many ng as there are ua, so these relations can be inverted to give the 
La in terms of the na and the temperature, from which we can calculate the 
density of anything else as functions of the na and the temperature, including 
the density of a nonconserved quantity like baryon or lepton number that 
is not among the Q4. 

In the case that interests us here, at temperatures above about 10!° K, all 
particles of the Standard Model are highly relativistic, so that their masses 
can be neglected. The number density of particle species i is then 


ee Si dp 
O (Arh J e-vad/kaT = | 


kpT 3 pœ x? dx 


where g; is the number of helicity (and other sources of multiplicity) states 
for each species, and the + sign is — for bosons and + for fermions. The 
antiparticle density 7; will be given by the same formula, but with u; replaced 
with — u;i, so the difference is 


ni —n; = 8x (ET) sion ( a Me x? e” dx 
DOPE ENIR kgT J Jo e2* F2e* cosh(uj/kgT) +1 — 
(3.3.9) 


In the situation that concerns us here, the imbalance between particles and 
antiparticles of all sorts is small, so |u;| « 1 for all particle species. In this 


case, 
_ kpT 3 li œ x2 e* dx 
;— nj = j : .3.1 
EN Brg: (57) =a (ex = 1)? (Aao 


The integral over x has the value 17/3 for bosons and x?/6 for fermions, 
so we can write this as 


ni — ni = f (T) Zi hi ,ọ (3.3.11) 
where BONT 
TENG 
TUIS Onh? ’ (3.3.12) 


and g; is the number of spin states, but with an extra factor of 2 for bosons. 
Using Eq. (3.3.7), Eq. (3.3.11) becomes 


ni — i = f(T) i daikta» (3.3.13) 
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In particular, the density of the conserved quantum number Q4 is 
na = Ydai(mi — Fi) =f(T) S Maw to » (3.3.14) 
i b 


where M is the matrix 


Map = Y Ži qai qvi - (3.3.15) 
i 


This matrix is positive-definite (in the sense that, for any set of real numbers 
Ea, we have J` p Mapa& > 0, unless all £4 vanish), and therefore it has 
an inverse M~. We can thus invert the relation (3.3.14), and find wg = 
$, Mz, ns /f (T). Using this in Eq. (3.3.13) gives 


ni — ñi = J Ši dai May "o (3.3.16) 
ab 


for any particle species 7. Note in particular that if the densities np of all 
the conserved quantum numbers vanish, then there is an equal number of 
particles and antiparticles of every kind, as mentioned above. 

In order to deal with the case where some conserved quantities such as 
B — L have non-zero densities, we need to calculate the matrix Map. The 
particles of the Standard Model are listed in Table 3.4. For Ng generations 
of quarks and leptons and N4 scalar doublets, the independent elements of 
the matrix Map are 


13 Ng O 8N 10 Ng 
T 3 


MB-L B-L = 
(3.3.17) 
We don’t need any of the matrix elements involving T3, because the sum of 
the T3 values vanishes for all the particles with any given values of B — L 
and Y, so that Mg-L 7; = My 7, = 0. That is, the matrix Ma is block- 
diagonal, with a 2 x 2 block having a and b running over B — L and Y. 
This has the consequence that without bothering to calculate M7, 7,, we 
can calculate that 
re 10N, Na i 8 Ng -1 _ 13N, 
B-L B-L~ 3p D’ B-LY “apy: ? PY 3p 3 
(3.3.18) 
where D is the determinant 


PE 22N? , BNNa 
3 3 o 


(Similarly, because the sum of the color quantum numbers vanishes for all 
the particles with any given values of B — L and Y, we do not need to take 


(3.3.19) 
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Table 3.4: Particles of the Standard Model, together with the number g of their helicity 
and color states (with an extra factor 2 for bosons), and the values of their baryon number, 
lepton number, and gauge quantum numbers. Only one “generation” of quarks and leptons 
and only one doublet of scalar fields are shown. The subscripts L and R denote the helicity 
states of quarks u and d and leptons v and e. Antiparticles are not shown separately, and 
the photon and Z 0 are not shown because they are their own antiparticles, and so do not 
contribute to the densities of any quantum numbers. Color quantum numbers are not shown, 
for reasons given in the text. 


Particle g B L T3 Y 
uL 3 1/3 0 1/2 —1/6 
dL 3 1/3 0 —1/2 —1/6 
UR 3 1/3 0 0 —2/3 
dR 3 1/3 0 0 1/3 
VL 1 0 1 1/2 1/2 
eL 1 0 1 —1/2 1/2 
eR 1 0 1 0 1 
W+ 4 0 0 1 0 
gr 2 0 0 1/2 —1/2 
g? 2 0 0 —1/2 —1/2 

gluons 4 0 0 0 0 


the color quantum numbers into account here.) Thus if B — L is the only 
conserved quantum number with a non-vanishing number density, then 
Eqs. (3.3.16) and (3.3.18) tell us that the baryon number density in thermal 
equilibrium is 


ng = X Bi(ni — ñi) = > Bi (B — L)iMg L B-L + YıMy'p-1 )"B-L 


t l 
4 E 
F MRE B-L a B-L | Ng nB-L 


N ee ee es _ 3.3.20 
(ei) G ae 
For instance, in the minimal experimentally allowed case, with Ng = 3 and 


Na = 1, this gives ng = (28/79)ng_y. In any case, ng turns out to be of 
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the same order of magnitude as ng_,;, as anticipated above. The reason for 
the smallness of ng/n, in this scenario would be traced to the smallness of 
CP violation in the out-of-equilibrium heavy particle decay that produces 
a non-vanishing density of B — L. 


4. Affleck—Dine mechanism!>° 


It is possible for baryon number nonconservation to occur in the 
nonequilibrium dynamics of a scalar field that carries a non-zero baryon 
number. Both baryon-number conservation and CP-invariance need to 
be violated in the Lagrangian of the scalar field. Such theories find their 
motivation in supersymmetry, which lies outside the scope of this book. 


5. Equilibrium baryon synthesis 
The violation of CPT by the expansion of the universe means that the part 
of the Hamiltonian that is odd in C and CP can have a non-zero expectation 
value AZ; in the state of a single baryon of type i, and of course opposite 
expectation value — A£; in the state of the corresponding antibaryon. Then 
even the universe starts in a state of zero baryon number, if it enters a state of 
thermal equilibrium in which baryon-number nonconserving processes occur 
rapidly, although there will be no chemical potential associated with baryon 
number, the difference in energy of baryons and antibaryons will lead to a 
net baryon number. When baryon-number nonconserving processes become 
ineffective the resulting baryon number density will survive, simply decreas- 
ingasa~*. Aslongas baryon-conserving collisions remain sufficiently rapid, 
the one-particle distribution will have the form appropriate for thermal equi- 
librium, but now with a non-vanishing baryonic chemical potential. 

As a general class of theories of this sort, suppose that in the expanding 
universe, there is a term in the Lagrangian density of the form 


AL(x) = -y —Detg V(x) J5 (x), (3.3.21) 


where V,,(x) is a classical vector field and J 5 (x) is the current associated 
with baryon number (for which JL) is the baryon density, which is odd 
under C and CP). Two proposals of this sort have been made: the vector 
field could be V, = M~'!d,,, where ọ is some scalar field! and M is 
some large mass; or the vector field could be V, = M -23 R, where R is 
the curvature scalar!” and again M is some large mass. In any case, the 
isotropy and homogeneity of the Robertson—Walker metric requires that 


15L, Affleck and M. Dine, Nucl. Phys. B 249, 361 (1985); M. Dine, L. Randall, and S. Thomas, Phys. 
Rey. Lett. 75, 398 (1995); Nucl. Phys. B 458, 291 (1996). 

16A. G. Cohen and D. B. Kaplan, Phys. Lett. B 199, 251 (1987). The production of baryon number 
is suppressed when the scalar field oscillates rapidly; see A. Dolgov, K. Freese, R. Rangarajan, and 
M. Srednicki, Phys. Rev. D 56, 6155 (1997) [hep-ph/9610405]. 

17H, Davoudiasl, R. Kitano, G. D. Kribs, H. Murayama, and P. J. Steinhardt, Phys. Rev. Lett. 93, 
201301 (2004). 
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V; vanishes, while Vp is a function only of time. This interaction then shifts 
the energy of the state of a single particle of type i with baryon number b; by 
an amount A&E; = Vo(t) ;. In thermal (including chemical) equilibrium, 
the baryon number density will be 


Amp? dp 24.m?+b;Vo(t))/k = 
ng(t) = J bisi R fe Pm HOD kBTO cell 8.3.22 
l 


where the sum over i runs over all particle (and antiparticle) types; g; and 
bi are the number of spin states and the baryon number of a particle of 
type i; and the upper and lower signs again apply to bosons and fermions, 
respectively. If baryon non-conserving collisions shut off suddenly at time 
tı, then subsequently the baryon number density will be 


3 2 -1 
t Arp d 
a Gay Db Gx [e E oe J 


i 


(3.3.23) 


For Vo = 0 the cancelation between baryons and antibaryons of course 
makes this vanish. Since the baryon/entropy ratio is known to be small, we 
expect Vo(t1) to be small; to first order in Vo(t1), the baryon number density 
for t > tı will be 


_ (ay V aa, 
no=() kpT() £ bigi 


—2 
Anp? dp /P+m/ke Tin |, pPe+milkeT(n) 11 (3.3.24) 
rh) 


A similar mechanism could also be responsible for lepton synthesis. 


The crucial confirmation of any theory of baryon synthesis would be a 
successful prediction of the present baryon/photon ratio. So far, none of 
the proposals discussed here are anywhere near this goal. 


3.4 Cold dark matter 


We saw in Section 3.2 that considerations of cosmological nucleosynthesis 
lead to the conclusion that most of the mass in the universe is not in the form 
of ordinary baryonic matter, i.e. atomic nuclei and electrons. We will see in 
Chapter 7 that this conclusion is powerfully reinforced by observations of 
anisotropies in the cosmic microwave background. So we face the question, 
ifthe particles that make up most of the mass of the universe are not baryons, 
then what are they? 
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We know that this matter is dark, in the sense that it does not interact 
significantly with radiation, both because we don’t see it, and also because it 
has not lost its kinetic energy sufficiently to relax into the disks of galaxies, 
as has baryonic matter. This means in particular that these particles must 
be electrically neutral.! Detailed studies of the dynamics of galaxy clusters 
indicate that the dark matter particles must also be cold, in the sense that 
their velocities are highly non-relativistic. 

The study of a double galaxy cluster 1E0657-558 (the “bullet cluster,” 
with z = 0.296) has provided vivid direct evidence of the existence of dark 
matter, which does not have non-gravitational interactions with itself or with 
ordinary baryonic matter.* The galaxies in this cluster are mostly grouped 
into two distinct subclusters, while hot gas (observed through its emission of 
X-rays) is concentrated between these subclusters. The interpretation is that 
two clusters of galaxies have collided; the galaxies which have little chance 
of close encounters have mostly continued on their original paths, while 
the two clouds of hot gas that previously accompanied them have collided 
and remained closer to the center of the double cluster. The total matter 
density in 1E0657-558 is mapped out through its effect in gravitationally 
deflecting light from more distant galaxies along the same line of sight. 
(Gravitational lensing is discussed in Chapter 9.) In this way, it is found that 
most of the matter in 1E0657-558 is not associated with the hot gas, but like 
the galaxies forms two subclusters that have evidently passed through each 
other without appreciable interaction. The ratio of the mass in hot gas to 
the mass in all matter is estimated to be about 1/6, in line with the value of 
g/m previously inferred from measurements of deuterium abundance 
and luminosity distance as a function of redshift, or from anisotropies in 
the cosmic microwave background. 

Elementary particle theory offers several candidates for the particles 
making up the cold dark matter. 


A. Weakly interacting massive particles (WIMPs) 


Massive particles may survive to the present if they carry some sort of 
conserved additive or multiplicative quantum number. If there is a non-zero 


'There are particularly strong limitations on the number density of any sort of charged stable particles 
that might be left over from the big bang, which are set by mass spectroscopy, the analysis of the charged 
particles contained in samples of matter according to their ratio of mass to charge. The number of 
electrically charged exotic particles with masses in the range of 6 GeV to 330 GeV has been found to be 
less than 1072! of the number of nucleons, by P. F. Smith and J. R. J. Bennett, Nucl. Phys. B 149, 525 
(1979). 

2P JE. Peebles, Astrophys. J. 263, L1 (1983); G. R. Blumenthal, S. M. Faber, J. R. Primack, and 
M. J. Rees, Nature 311, 517 (1984). 

3D. Clowe et al., Astrophys. J. 648, L109 (2006) [astro-ph/0608407]. 
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chemical potential associated with this quantum number, then of course 
some particles (or antiparticles) must be left over after all the antipar- 
ticles (or particles) have annihilated. But even if there are no non-zero 
chemical potentials for these particles, so that the initial number densi- 
ties of particles and antiparticles are equal, if they can only annihilate 
with their antiparticles then once their number density becomes sufficiently 
low the collision rate eventually becomes too small to reduce the density 
further.+ We will call these particles L-particles (for “left-over”), to distin- 
guish them from the other particles into which they may annihilate, which 
we will assume are all approximately in thermal and chemical equilibrium 
during the period of annihilation. The annihilation rate per particle of 
the L particles and antiparticles is n(ov), where n is their number den- 
sity, and (ov) is the average value of the product of annihilation cross 
section and relative velocity. The rate of decrease in the number of L 
particles in a co-moving volume a? is then na? x n(ov). There is also 
an n-independent rate of creation of these particle—antiparticle pairs from 
the thermal background. Since this must balance the annihilation rate 
when everything is in equilibrium, in general the creation rate per volume 


a? must equal nga? (ov), where neq is the number density of L particles 


and of antiparticles in equilibrium. The number na? of L particles and of 
antiparticles in a co-moving volume a’ is therefore governed by a Boltzmann 
equation 


3 
ae ys —(w 5 Me) a (ov) l (3.4.1) 


For very high temperatures with kgT >> my the equilibrium density neq 
varies as T?, and T varies as 1/a, so Eq. (3.4.1) has a solution n = Neq: 
Eventually, with the decrease in temperature below the L-particle mass, 


the equilibrium density drops so low that the creation term in Eq. (3.4.1) 
becomes negligible, and we have 


3 
au bas SOG (3.4.2) 
The solution of Eq. (3.4.2) is 
z = constant +f jens 
n(tyaa(t) eae 


4B. W. Lee and S. Weinberg, Phys. Rev. Lett. 39, 165 (1977); D. D. Dicus, E. W. Kolb, and V. 
L. Teplitz, Phys. Rev. Lett. 39, 168 (1977); E. W. Kolb and K. A. Olive, Phys. Rev. D 33, 1202 
(1986). 
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or, in other words, 


n(t1)a>(t1) 
L+n(t) a(t) fy (ov) a) dt! 


nD (t) = 


where ż; is any convenient time chosen late enough so that the creation term 
in Eq. (3.4.1) may be neglected for t > t4. 

The important point here is that the integral in the denominator con- 
verges for £ —> oo. The denominator a(t) increases like t?/? when the 
energy density is dominated by relativistic particles, and even faster later, 
when it is dominated by non-relativistic particles and/or vacuum energy. 
Also, if annihilation is possible from states of zero orbital angular momen- 
tum then ov approaches a constant for low energies, so its thermal average 
(ov) approaches a constant for low temperatures, and hence for late times. 
The contribution of states of higher orbital angular momentum decreases 
with decreasing temperature, so if s wave annihilation is forbidden by selec- 
tion rules the integral converges even faster. Because the integral converges, 
the particle number in a co-moving volume a? approaches a finite limit: 


n(t)a°(t1) 
1+ n(ty) ÈC) f° (ov) a3) de 


nD (t) > (3.4.3) 


Let us assume that the annihilation of L particles and antiparticles took 
place during a time when the density of the universe was dominated by 
relativistic particles, so that a « 1/T, and the time is given by Eq. (3.1.14): 


3 dT 
sag A E 
a 16r GNag T? ’ 


or, using the formula ag = x’kg/15 withfi=c=1, 


45 _7dx 


where x = kgT/m_. Eq. (3.4.1) therefore takes the form 


du(x) 
TA = B|) — 14] (3.4.5) 
where u(x) is the dimensionless quantity of interest 
u=n/(kpT) , (3.4.6) 


ueq(kgT /mz) is its equilibrium value, and B is the dimensionless parameter 


|) 45 
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We see that the left-over value of n/(kg T)? depends only on B, and on the 
spin of the L particles, which we need to know to give a formula for ueq (x): 


af Any? dy 
(27)? Jo expyx2+y2+1 


where y = p/kgT, and as usual the sign is +1 for fermions and —1 for 
bosons. 

For instance, for kgT « mz the heavy particles are non-relativistic, and 
the mean value of the low-energy annihilation cross section times velocity 
is a constant, 


ueq (x) = (3.4.8) 


(ov) = Gmi F/27 , (3.4.9) 


where Gwg = 1.1664 x 1075 GeV~? is the weak coupling constant, and F is 
a fudge factor to take account of the number of annihilation channels and 
the details of the interaction responsible for the annihilation. This gives 


3 
B = 1.59 x 108 (mı [GeV]) FN". (3.4.10) 


The solution of Eq. (3.4.5) for various values of B and sz = 1/2 is shown in 
Figure 3.2. We see that u(x) drops steeply from a constant B-independent 
value for x > 1 to a constant B-dependent value for x < 0.01. 

For mz in the GeV range B is quite large, and for such values of B (and 
sL = 1/2) the asymptotic value of u is well approximated by 


u(0) ~ 6.1 B~?> (3.4.11) 


B=10° 


| B=10!0 
| B=10!! 


2 s : > X 
107 107 107! 1 10 


Figure 3.2: The function u(x) = n/(kg TP vys x= kgT/m_, for a dark matter particle of 
spin 1/2 and mass mz , with various values of the parameter B defined by Eq. (3.4.7). 
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(A value u(0) x B7! is what we would expect if the second term in the 
denominator in Eq. (3.4.3) were much greater than unity.) The present 
mass density of these heavy particles and antiparticles is then (recalling that 
ect OF boston annihilation increases the photon temperature by a factor 
11/4 >) 


pL = 2mru(0)(4/11)(kg T0)? 
= (2.15 x 10-*8g/em*) (mGev)) (FINN) ” , (8.4.12) 


or in other words 
5 os -1.85 —0.95 
Q = 8r GpL/3 Hè = 11.5h (mi(GeV)) (FINN) . (3.4.13) 


Note that Qz is a decreasing function of mz, because heavy L particles 
annihilate more effectively than light ones. If we assume that the left-over 
heavy particles make up most of the cosmic mass density, then Qy ~ QL, 
and so 


mi (F/NS! ~ 3.7 (Qyh?)~™ GeV . (3.4.14) 


which is 10 GeV for Qyh* = 0.15. Otherwise this provides a lower bound 
on the mass of these particles. 

Incidentally, similar arguments apply to the annihilation of nucleons and 
antinucleons, if the universe has zero net baryon number. The difference 
here of course is that instead of (ov) being of order Gzmij, it is roughly of 
order m=?. We can thus estimate the density of left-over baryon—antibaryon 
pairs by taking mg = my, N ~ 10, and replacing F in Eq. (3.4.13) with 
2nmz?/Gpmy = 2.7 x 10!?, so that Eq. (3.4.13) would give a baryon- 
antibaryon density parameter Ng, ph? ~ 6 x 10711. This is much less than 
the present observed density parameter of baryons, ruling out the possibility 
that the baryons around us are just those that missed being annihilated in an 
initially baryon—antibaryon symmetric universe, and then somehow became 
segregated from the antibaryons. 

Returning to the cold dark matter, originally it was thought that the 
L particles might be heavy neutrinos. They could not be any of the three 
known neutrino types, which as discussed at the end of Section 3.1 have 
masses at most of the order of 1 eV, but there could be a fourth generation 
of very heavy leptons that have negligible mixing with the known leptons. 
The mass of a new heavy neutrino would have to be greater than mz/2 = 45 
GeV, to block the decay of the Z? into a heavy neutrino and antineutrino, 
which if it occurred would destroy the present excellent agreement between 
theory and experiment for the total decay rate of the Z? particle. If the 
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L particles were heavy neutrinos, then the fudge factor F in Eq. (3.4.9) 
would be of the order of the number of types of particles into which the L 
particles might annihilate. If we assume very roughly that F ~ MN ~ 100, 
then Eq. (3.4.13) gives zh? < 1073. So it does not seem that the cold dark 
matter could consist of a new heavy neutrino. 

The most plausible candidate for the L particle is one of the new particles 
required by supersymmetry.” In many supersymmetric theories there is a 
multiplicatively conserved quantum number R, which takes the value +1 
for all the known particles of the Standard Model, and —1 for their super- 
symmetric partners. (Multiplicative conservation means that the product 
of the Rs for all the particles in the final state of any reaction is the same 
as for the initial state.) Among other things, this conservation law tells us 
that the lightest particle with R = —1 (which is often called the LSP, for 
“lightest supersymmetric particle”) is stable, although two of these particles 
could annihilate into ordinary particles with R = +1. To judge which is 
the lightest particle with R = —1, it is necessary to distinguish between two 
possible pictures of supersymmetry breaking.® In both pictures supersym- 
metry is spontaneously broken by non-perturbative effects in some hidden 
sector of particles with a large typical mass Ms, which interact through 
some strong force that is not felt by the known particles of the Standard 
Model or their superpartners. The supersymmetry breaking in the hidden 
sector also gives the gravitino (the superpartner of the graviton with spin 
3/2) amass mg © vV GMZ. 

In one picture of supersymmetry breaking, the breakdown of super- 
symmetry is communicated to the particles of the Standard Model and 
their superpartners by the electroweak and ordinary strong forces of the 
Standard Model. In this case Ms would have to be of the order of 100 
GeV to 100 TeV, and the gravitino mass would be at most of order 1 eV. 
The lightest particle with R = —1 would be the gravitino, which would 
be too light to furnish the cold dark matter. Even if gravitinos were once 
in equilibrium with other particles, and did not annihilate or decay, by 
the same arguments as in Section 3.1 their number density now would 
be less than the number density of photons by a factor of order 2/N, 
where M is the effective number of relativistic particle states at the time 
that gravitinos went out of thermal equilibrium. Photons with typical 
energy kgT,o have an energy density parameter given by Eq. (2.1.8) as 
Qyh? = 2.47 x 1075, so in order for gravitinos to furnish cold dark matter 
with Qyh* ~ 0.15, the mass of the gravitino would have to be of order 


>This and other possibilities are discussed in a comprehensive review by G. Bertone, D. Hooper, and 
J. Silk, Phys. Rep. 405, 279 (2005) [hep-ph/0404175]. 
See QTF, Vol. II, Sec. 28.3. 
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0.15 N kgT,0/2Q,h’, which for M ~ 100 is roughly 70 eV.’ Thus in this 
picture of supersymmetry breaking, gravitinos are not likely to be the cold 
dark matter, though this is not absolutely ruled out. The next-to-lightest 
particles with R = —1 would all presumably decay over billions of years into 
gravitinos and ordinary particles, so they could not furnish the cold dark 
matter either. (The rate of decay into relativistic gravitinos with helicity 
+3/2 would be suppressed by a factor G, but particles with R = —1 could 
decay into gravitinos with helicity +1/2 with the decay rate suppressed only 
bya factor My 2 and by some powers of the gauge couplings of the Standard 
Model.) 

In the other picture, supersymmetry breaking is mediated by the gravi- 
tational field and its superpartners, and because these interactions are very 
weak Ms must be correspondingly large, of order 10!! to 10!3 GeV. The 
gravitino mass would be of the same order of magnitude as the masses of 
the superpartners of the known particles of the Standard Model, so it might 
or might not be the lightest particle with R = —1. 

If the lightest particle with R = —1 is not a gravitino, then it could be 
either a sneutrino, the spin 0 superpartner of the neutrino, or a neutralino, 
the spin 1/2 superpartner of some mixture of the neutral gauge and scalar 
bosons of the Standard Model. Whichever of these is the lightest, the decay 
process will involve the exchange of a particle with R = —1, having a mass 
M that is expected to be of the order of a TeV or so. Hence its annihilation 
amplitude (aside from factors of 2 and 7) will be of order g?/M7, where g 
is a typical electroweak coupling. This is smaller than the weak coupling 
constant Gy, by a factor of order My /M7, so the fudge factor F in the 
annihilation rate constant (3.4.9) will be of order myN. (4 /M* = 10-4Ny, 
where M4 is the number of annihilation channels. Taking N4/ VN ~ 1, 
Eq. (3.4.14) tells us that to furnish cold dark matter with Rmh? ~ 0.15, the 
lightest particle with R = —1 (if not the gravitino) would have to have a 
mass of order 1 TeV. This is similar to estimates of the masses of the super- 
partners of the particles of the Standard Model in typical supersymmetric 
models, a circumstance that greatly encourages the hope that the particles 
of dark matter will be found to be created in experiments at high energy 
accelerators. 

Cosmological considerations rule out the possibility that the lightest 
particle with R = —1 in this picture of supersymmetry breaking is the grav- 
itino. Unlike the superpartners of the particles of the Standard Model, the 


TH. Pagels and J. R. Primack, Phys. Rev. Lett. 48, 223 (1982) first remarked that in order for the 
gravitino not to give Qyy > 1, its mass would have to be less than about | keV. 
8s. Weinberg, Phys. Rev. Lett. 50, 387 (1983). 
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gravitino has a two-body annihilation amplitude proportional to two fac- 
tors? of the gravitational coupling constant vG, so the annihilation rate is 
proportional to G?, and is therefore much too small for annihilation to play 
any significant role in reducing the number density of gravitinos. If, because 
of particle annihilations after gravitino decoupling, the number density of 
gravitinos were now, say, 1% of the number density of photons, then, as 
we have seen, in order for their mass density now not to exceed cosmolog- 
ical bounds their mass would have to be less than roughly 100 eV. This is 
quite inconsistent with the gravitino masses expected in theories of gravity 
mediated supersymmetry breaking. But if the gravitinos are not the lightest 
particles with R = —1 then they can decay, reducing their present mass den- 
sity to acceptable values even if their masses are quite high.!° The coupling 
of the gravitino to other fields is proportional to V/G, so on dimensional 
grounds the decay rate I’, of a gravitino at rest is roughly of the order of Gm. 
This is to be compared with the rate of expansion of the universe, which at 
temperature T is of order y G(kgT)*. (Weare here ignoring factors of order 
10-100, including those involving non-gravitational coupling constants and 
the number of particle species.) When the cosmic temperature drops to the 
value kgT % mg at which gravitinos become non-relativistic, the ratio of 
their decay rate to the expansion rate is of order ~ Gmg = Mg /MPlanck << 1, 
so gravitino decay becomes significant only after this time, when the graviti- 
nos are highly non-relativistic. As we have seen, their number density will 
be of order (kg T)’, so their energy density will then be of order Mg(kgT )3, 
which is greater than the energy density of order (kgT)* of the photons 
and other relativistic particles in thermal equilibrium at temperature 7, 
and therefore makes the dominant contribution to the cosmic gravitational 
field that governs the rate of expansion of the universe. The expansion rate 


under these conditions is therefore of order ,/Gmg(kgT)?, and gravitino 


decay becomes significant when this equals the gravitino decay rate of order 
Gm, and therefore at a temperature 


kez © G' 8 mi? 


After they decay, their energy must go into the energy of photons and 
other relativistic particles, so the temperature T y after decay is related to 
the temperature Tọ calculated above by the energy conservation condition 


This does not include a factor of /G in the gravitino mass, because this factor is multiplied by the 
scale of supersymmetry breaking, which for gravitationally mediated supersymmetry breaking is very 
large, giving a gravitino mass that is comparable to the mass of other supersymmetric particles. 

10S, Weinberg, Phys. Rev. Lett. 48, 223 (1982). 


193 


3 The Early Universe 
m,T? ~ T'4, and hence 
E- g g? 
1 me G14 3/2 
kpT, G m°. 


In particular, since kgTg << mg, we have T a >> T,. If Tọ were less than 
the temperature T, ~ 0.1 MeV at which cosmological nucleosynthesis can 
occur, then gravitinos would still be abundant before nucleosynthesis, giving 
a higher energy density and hence a faster expansion, so that there would be 
less time for free neutrons to decay before being incorporated into complex 
nuclei, and hence more helium would be produced when nucleosynthesis 
occurs. Also, the ratio of the photon and baryon densities would have 
been subsequently increased by gravitino decay, so this ratio at the time of 
nucleosynthesis would have been considerably less than is usually estimated 
from the present cosmic microwave background temperature, and so nuclear 
reactions would have incorporated neutrons more completely into helium, 
and less deuterium would be left today. The present agreement between 
theory and observation for the cosmic helium and deuterium abundances 
would thus be destroyed. This problem is avoided if T, > 0.1 MeV, but it 
can also be avoided under the much weaker condition that T 6 > 0.4 MeV, 
because then after the gravitinos decay the temperature would have been 
high enough to break up the excess helium and give cosmological nucle- 
osynthesis a fresh start as the universe recools. This condition requires that 
mg > 10 TeV. This limit on mg corresponds to a supersymmetry breaking 
scale Ms > 10!! GeV for mg ~ /GM&. 

It may be possible to detect cosmic WIMPs through observation of the 
recoil of atomic nuclei from which they scatter elastically,!! as for instance 
the scattering of heavy neutrinos through the neutral current weak inter- 
action.!? This is being pursued by a number of collaborations: DAMA,!3 
CRESST,!4 EDELWEISS,!> UK Dark Matter,!® CDMS,!” and WARP. 
Assuming that WIMPs are more or less at rest in the halo of our galaxy, the 
motion of the solar system through the halo produces a WIMP “wind” with 


1 For reviews of current experiments, see Y. Ramachers, Nucl. Phys. B. Proc. Suppl. [astro-ph/ 
0211500]; G. Chardin, in Cryogenic Particle Detection, ed. C. Ens (Springer, Heidelberg, 2005) [astro- 
ph/0411503]; R. J. Gaitskell, Ann. Rev. Nucl. Part. Sci. 54, 315 (2004); J. Ellis, K. Olive, Y. Santoso, and 
V.C. Spanos, Phys. Rev. D 71, 095007 (2005) [hep-ph/0502001]; K. Freese, Nucl. Instrum. Meth. A559, 
337 (2006) [astro-ph/0508279]; L. Baudis, Int. J. Mod. Phys. A21, 1925 (2006) [astro-ph/0511805]. 
2A. Drukier and L. Stodolsky, Phys. Rev. D 30, 2295 (1985); M. Goodman and E. Witten, Phys. 
Rey. D 31, 3059 (1985). 
13k. Bernabei et al.,Phys. Lett. B480, 23 (2000). 
4G. Angloher et al., Astropart. Phys. 23, 325 (2005) [astro-ph/0408006]. 
15y, Sanglard et al., Phys. Rev. D 71, 122002 (2005) [astro-ph/0503265]. 
6G. J. Alner et al., Astropart. Phys. 23, 444 (2005). 

TD. S. Akerib et al., Phys. Rev. Lett. 93, 211301 (2004) [astro-ph/0405033], and astro-ph/0507190. 
8P Benetti et al., astro-ph/0701286. 
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a speed vy of about 220 km/s. A nucleus of mass Amy when struck with a 
WIMP of mass m, >> Amy traveling at this velocity will recoil with a typical 
velocity of order vwy, and hence a kinetic energy of order Amy v2, /2 ~ 100A 
eV. These recoils can be detected by observing the ionization of atoms struck 
by the recoiling nucleus, or by detecting light emitted by these atoms, or by 
detecting vibrations in the crystal lattice of the detector. The mass density 
pn in the halo of our galaxy near earth is estimated from observations of 
stellar motions to be about 0.3 GeV/cm?, giving a number density for WIMP 
mass Mw Of pn/Mw, from which the elastic scattering rate can be calculated 
for any assumed values of the WIMP mass and the scattering cross section. 
Failing to observe nuclear recoil events then excludes some region of the 
My—cross-section plane. 

The greatest problem in these experiments is distinguishing true WIMP 
events from background, caused by natural radioactivity and cosmic rays. 
The best hope for distinguishing events from background is to exploit the 
motion of the earth around the sun.!? This orbital motion adds about 
15 km/s to the speed of the WIMP wind in summer, and subtracts an 
equal amount in winter (though this depends on the halo model), so one 
may expect a 7% seasonal modulation of true WIMP events. The DAMA 
collaboration reported just such a modulation, but almost all of the region 
in the m,,—cross-section plane that would account for this observation was 
subsequently apparently excluded (using a different detection scheme) by 
the CDMS collaboration. So far, experiments set an upper bound on the 
effective cross section of about 1074 to 107% cm? for m, > 50 GeV, and 
much larger form, < 50 GeV, the precise bound depending on assumptions 
about the distribution of WIMPs in the galactic halo.?? (For comparison, 
the effective cross section for the low energy scattering of the neutralinos 
of supersymmetric theories on nucleons is expected to be less than about 
1074 cm?.) 

There is also a possibility of an indirect detection of WIMPs, through 
observation of gamma rays or other particles produced when pairs of WIMPs 
annihilate in regions of high WIMP concentration.*! Gamma rays do not 
penetrate the earth’s atmosphere, so they have to be detected either through 
the observation of showers of charged particles produced in the atmosphere, 
as for instance by the Cerenkov radiation associated with these particles, 
or by gamma ray telescopes carried by balloons or by satellites in orbit 
above the earth’s atmosphere. If WIMP-WIMP annihilation produces 


DAK. Drukier, K. Freese, and D. N. Spergel, Phys. Rev. D 33, 3495 (1986); K. Freese, J. A. Frieman, 
and A. Gould, Phys. Rev. D 37, 3388 (1988). 

20. Bottino, F. Donato, N. Fornengo, and S. Scopel, Phys. Rev. D 72, 083521 (2005) [hep-ph/ 
0508270]. 

21 For a review, see P. Gondolo, hep-ph/0501134. 
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just a pair of gamma rays, then each will carry a unique energy equal 
to the WIMP mass, giving a very clear WIMP signal. So far, there has 
been no sign of such a monochromatic gamma ray. Of course the annihi- 
lation may produce other particles along with gamma rays, or particles 
whose decay then produces gamma rays, but in either case this would 
yield a continuum of gamma ray energies, which would be much harder 
to identify as coming from WIMP annihilation. The Cangaroo-II*? and 
HESS”? atmospheric Cerenkov detector collaborations have both reported 
continuum sources of gamma rays coming from near the center of our 
galaxy, but with very different spectra, which can be interpreted as com- 
ing from annihilation of WIMPs with a mass of about 1 TeV or 19 TeV, 
respectively. 

For decades there has been evidence of monochromatic gamma rays 
coming from the galactic center, but at the energy of 511 keV expected from 
electron—positron annihilation, rather than WIMP-—WIMP annihilation. In 
1970 a balloon-borne pair of gamma ray detectors found evidence of a 
gamma ray line around 500 keV coming more-or-less from the direction of 
the galactic center.*4 The evidence for this has since become much stronger 
through observations made by the INTEGRAL (International Gamma- 
Ray Astrophysics Laboratory) satellite, which found a gamma ray line 
coming from the galactic center with an energy within about 3 keV of 511 
keV. It is possible that this is due to the decay of a relatively light WIMP into 
electron—positron pairs, with the positrons then losing energy by ionization, 
after which they annihilate with ambient electrons.7° 

There is also a report of an excess of positrons in cosmic rays found 
by the HEAT balloon experiment,”’ which might or might not come from 
WIMP annihilation. Very recently it has been suggested that an excess of 
microwave emission from the direction of the center of our galaxy observed 
by the WMAP satellite may come from synchrotron emission by relativistic 
electrons and positrons produced in WIMP annihilation.7® 

It is too early to reach any definite conclusions from any of these exper- 
iments about WIMPs as candidates for the particles of dark matter. 


22K, Tsuchiya et al., Astrophys. J. 606, L115 (2004) [astro-ph/0403592]. 

23D, Horns, Phys. Lett. B 607, 225 (2005). But see F. Aharonian,Phys. Rey. Lett. 97, 221102 (2006) 

24W, N. Johnson, III, F. R. Harnden, Jr., and R. C. Haymes, Astrophys. J. 172, L1 (1972.) 

25p Jean et al., Astron. Astrophys. 407, 55 (2003) [astro-ph/0309484]; J. Knédlseder et al., Astron. 
Astrophys. 411, 457 (2003) [astro-ph/0309442]; E. Churazov et al., Mon. Not. Roy. Astron. Soc. 
357, 1377 (2005) [astro-ph/0411351]; J. Knédlseder et al., Astron. Astrophys. 441, 513 (2005) [astro- 
ph/0506026]; G. Weidenspointer et al., Astron. Astrophys. 450, 1013 (2006) [astro-ph/0601673]. 

26C, Boehm, D. Hooper, J. Silk, M. Casse, and J. Paul, Phys. Rev. Lett. 92, 1301 (2004) [astro-ph/ 
0309686]. 

278, W. Barwick et al., Phys. Rev. Lett. 75, 390 (1995); Astrophys. J. 482, L191 (1997). 

8p, Hooper, D. P. Finkbeiner, and G. Dobler, 0705.3655. 
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B. Axions and axinos 


2 are light neutral spinless particles that are made necessary”? by 


13! 


Axions 
the spontaneous breakdown of a symmetry that first appeared in a mode 
that was proposed to explain why non-perturbative effects of the strong 
interactions do not violate CP invariance. For our present purposes, all 
we need to know is that the dominant part of the effective action (with 
h = c = 1) that describes the axion field g takes the form (B.44) with 
potential V(~) = m2y7/2: 


1 1 
Ilg] = J d*x,/—Detg (323.0 dye — smi?) (3.4.15) 
The axion mass ma is a complicated function of temperature, but for tem- 
peratures well below 10!? K it takes a well-known constant value, related to 
the energy scale M at which the Peccei-Quinn symmetry is broken by 


Fym,z ./Mqmy a 13 MeV 


a , 3.4.16 


Ma) = 


(Here Fy = 184 MeV is the pion decay amplitude, and mq and m, are 
the down and up quark masses appearing in the Lagrangian of quantum 
chromodynamics, for which mg/m, ~ 1.85.) In the original Peccei-Quinn 
model,?! M was of the order of the electroweak symmetry breaking scale, 
M ~ 100 GeV, but it was soon realized that this is experimentally ruled 
out. Axion fields interact with ordinary matter through factors 0,,0/M, 
so with M > 100 GeV axions interact so weakly that they emerge without 
attenuation from reactor cores or stellar interiors. The production rate of 
axions is proportional to 1/M7, so the failure to observe effects of axion 
emission from stars or nuclear reactors sets a lower bound on M, and hence 
an upper bound on mg. In particular, limits on the rate of cooling of red 
giant stars by axion emission give? M > 10’ GeV, while observations of 
the supernova SN1987A indicate*? that M > 10!° GeV. A generalization of 
the Peccei-Quinn model was then proposed,*4 in which M is an arbitrary 


29See QTE, Vol II, Sec. 23.6. For a review, see P. Sikivie, in Axions — Lecture Notes on Physics, ed. 
M. Kuster (Springer-Verlag, to be published) [astro-ph/0610440]. 

305, Weinberg, Phys. Rev. Lett. 40, 223 (1978); F. Wilczek, Phys. Rev. Lett. 40, 279 (1978). 

SIR, D. Peccei and H. Quinn, Phys. Rev. Lett. 38, 1440 (1977); Phys. Rev. D 16, 1791 (1977). 

32D, A. Dicus, E. W. Kolb, V. I. Teplitz, and R. V. Wagoner,Phys. Rev. D 18, 1829 (1978); Phys. Rev. 
D 22, 839 (1980). 

33For reviews, see M. S. Turner, Phys. Rep.. 197, 67 (1990); G.G. Raffelt, Phys. Rep. 198. 1 (1990); 
P. Sikivie, ref. 26. 

34). E, Kim, Phys. Rev. Lett. 43, 103 (1979); M. Dine, W. Fischler, and M. Srednicki, Phys. Lett. 
104B, 199 (1981); M. B. Wise, H. Georgi, and S. L. Glashow, Phys. Rev. Lett. 47, 402 (1981). 
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parameter. As we shall see, cosmological considerations provide an upper 
bound?’ on M, which together with the above lower bounds leaves a narrow 
window of allowed values. 

The cosmological axion field is supposed to be spatially homogeneous. 
In a Robertson—Walker metric, the energy density and pressure are given 
by Eq. (B.66) and (B.67) as 


36 


1. 1 1. 
pa = se +M > Pa = 50 Maw (3.4.17) 
so the equation (1.1.32) of energy conservation (or the Euler-Lagrange 
equation derived directly from the action (3.4.15)) gives the field equation 


ğ+3HOH +m (Hy =0. (3.4.18) 


At early times H(t) >> m,(t), so we can ignore m,(t), and Eq. (3.4.17) has 
solutions gy = constant, and œ « 1/a?. Rejecting the singular solution, 
we see that at early times (t) is frozen at a value go, which (absent fine 
tuning) would be expected to be of order M. Later, when H(t) dropped 
below m,(t), y(t) began a rapid oscillation, so that in this case Eq. (3.4.18) 
can be solved using the WKB approximation, which gives 


3/2 t 
y(t) > g1 (2) cos (J Ma(t) dt + a) ; (3.4.19) 
a(t) 0 


where f] is the time at which H (t1) = mao, 91 is a constant of order go ~ M, 
and « is a phase that cannot be determined without a more detailed study, 
but which fortunately we do not need to know. (For instance, if ma(t) has 
the constant value mao and the universe is radiation-dominated in the era 
of interest, then gy) = 29/22~!/°1(5/4)gyo = 1.4460, and a = —3z/8.) 
Since a(t) for ma >> H was varying much more slowly than the phase of the 
cosine, the energy density at late times is given by Eq. (3.4.17) as 


1 3 
pal) > sme) (2 ) , (3.4.20) 


In order to project this forward to the present, we note that if the universe 
was radiation-dominated at time ft}, with M the effective number of types 
of particles with masses much less than kg T (t1) (counting each spin state 


35J, Preskill, M. B. Wise, and F. Wilczek, Phys. Lett. 120B, 127 (1983); L. F. Abbott and P. Sikivie, 
Phys. Lett. 120B, 127 (1983); M. Dine and W. Fischler, Phys. Lett. 120B, 137 (1983). For reviews, see 
J. E. Kim, Phys. Rep. 150, 1 (1987); M. S. Turner, Phys. Rep. 197, 68 (1990). 

36Components with non-zero wave number are doubtless present, but their energy density decays 
more rapidly than the energy density of the spatially homogeneous coherent field (£). 


198 


3.4 Cold dark matter 


of particles and antiparticles separately, and including an extra factor 7/8 
for fermions), then the photon temperature at present is 


Tyo = NIPT (tDaCt)/a(to) , 


the factor M n i being inserted to take account of the heating of photons 
by the annihilation of particles and antiparticles between times f; and to. 
The temperature T(t;) may be determined by noting that the expansion 
rate at time f is 


82 GN agT4 /2 4r? GN, 
mano” a e a 


Using these results in Eq. (3.4.20) then gives the present axion energy density 


i Ia 
a IT 
Palto) > zma N, M4 @2 (5) (kpT yo)? . (3.4.21) 


We expect go to be of the same order of magnitude as the symmetry breaking 
scale M, so using Eq. (3.4.16) and ignoring all factors of order unity, 


_ Fem. G4 (kgTy o) 
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(3.4.22) 


Equivalently, the axion density provides a fraction Q, of the critical density 
given by 


2379 
Qh? © (ma/1075eV) i (3.4.23) 


Because the axion field is spatially homogeneous, for mg >> H its energy 
takes the form of massive particles that are essentially at rest. If axions 
furnish the whole of the cold dark matter, then ma ~ 1075 eV, corresponding 
to M ~ 10!? GeV. Otherwise, these numbers provide a lower bound on ma 
and an upper bound on M. 

Axions are much too weakly interacting to be detected in the sort of 
nuclear recoils looked for in searches for WIMPs. One possibility is to 
observe the conversion of cosmic axions into photons in intense magnetic 
fields.?” This approach has already been used to put a limit on the param- 
eters of axions that would be produced by the sun.*® The axion field g(x) 
would be expected to have an interaction with the electromagnetic field of 


37P. Sikivie, Phys. Rev. Lett. 51, 1415 (1983); Phys. Rev. D 32, 2988 (1985). 
38K. Zioutas et al. (CERN Axion Solar Telescope collaboration), Phys. Rev. Lett. 94, 121301 (2005) 
[hep-ex/041 1033]. 
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the form ga, gE - B, with gay of order a/22M, so that photons in the 
sun could convert into axions in the presence of the strong electric fields 
around atomic nuclei, and these solar axions would then convert back into 
photons in intense laboratory magnetic fields. No such photons were seen, 
indicating that for axions with mass less than 0.02 eV, |gay| < 1.16 x 107!° 
GeV~—!, a much more restrictive limit than provided by earlier experiments 
of this sort.?? Ina different sort of search,® the Axion Dark Matter Exper- 
iment, a microwave cavity was used to search for axions in our galactic 
halo, and put upper limits on the axion density in the narrow mass range 
(1.98 to 2.17)x10~® eV. None of these experiments are in conflict with 
axion models of dark matter, but a plausible improvement in the sensi- 
tivity of this sort of experiment may rule out these models, or perhaps find 
axions. 

In supersymmetric theories, the axion would be partnered with a spin 
one-half particle, the axino, which would probably be the lightest particle 
with R = —1, and hence stable. Axinos could be produced non-thermally,*! 
through the decay of other particles with R = —1, or thermally.” It appears 
that the axino provides another plausible candidate for the particle of cold 
dark matter. 


39This and earlier experiments are reviewed by G. G. Raffelt, contribution to XI International 
Workshop on Neutrino Telescopes, hep-ph/0504152. 

vL, D. Duffy et al., Phys. Rev. D 74, 012006 (2006) [astro-ph/0603108], and earlier references cited 
therein. 

41L, Covi, J. E. Kim, and L. Roszkowski, Phys. Rev. Lett. 82, 4180 (1999). 

42L, Covi, H. B. Kim, J. E. Kim, and L. Roszkowski, J. High Energy Phys. 0105, 033 (2001) [hep- 
ph/0101009]; A. Brandenburg and F. D. Steffen, J. Cosm. & Astropart. Phys. 0408. 008 (2004) 
[hep-ph/0405158]. 
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Inflation 


We can have some confidence in the story of the evolution of the universe 
from the time of electron—positron annihilation to the present, as told in the 
previous three chapters. About earlier times, so far we can only speculate. In 
the past quarter century these speculations have centered on the idea that 
before the period of radiation domination, during which the Robertson— 
Walker scale factor a(t) was growing as ./t, there was an earlier period 
of inflation, when the energy density of the universe was dominated by a 
slowly varying vacuum energy, and a(t) grew more-or-less exponentially. 
The possibility of an early exponential expansion had been noticed by sev- 
eral authors,! but at first it attracted little attention. It was Alan Guth? who 
incited interest in the possibility of inflation by noting what it was good for. 

Guth noticed that, in a model of grand unification he was considering 
(with Henry Tye), scalar fields could get caught in a local minimum of the 
potential, which in his work corresponded to a state with an unbroken grand 
unified symmetry. The energy of empty space would then have remained 
constant for a while as the universe expanded, which would produce a 
constant rate of expansion, meaning that a(t) would have grown exponent- 
ially. Eventually this inflation would be stopped by quantum-mechanical 
barrier penetration, after which the scalar field would start rolling down the 
potential toward a global minimum, corresponding to the present universe. 
In itself this would have been a result of no great immediate importance. 
But then it occurred to Guth that the existence of an era of inflation would 
solve one of the outstanding problems of cosmology, mentioned here in 
Section 1.5. It is known as the “flatness problem:” Why was the curvature 
of space was so small in the early universe? Guth soon also discovered that 
inflation would solve other cosmological puzzles, some of which he had not 
even realized were puzzles. These problems along with the flatness problem 
will be discussed in Section 4.1. 

As Guth and others soon realized, his version of inflation had a fatal 
problem, to be described in Section 4.2. Guth’s “old inflation” was soon 
replaced with a “new inflation” model, due to Andrei Linde? and Andreas 
Albrecht and Paul Steinhardt.4 The essential element introduced by 


TAA, Starobinsky, JETP Lett. 30, 682 (1979); Phys. Lett. B91, 99 (1980); D. Kazanas, Astrophys. 
J. 241, L59 (1980); K. Sato, Mon. Not. Roy. Astron. Soc. 195, 467 (1981). 

2A. Guth, Phys. Rev. D 23, 347 (1981). Guth tells the story of this work in The Inflationary Universe: 
The Quest for a New Theory of Cosmic Origins (Helix Books/Addison Wesley, 1997). 

3A. D. Linde, Phys. Lett. B 108, 389 (1982); 114, 431 (1982); Phys. Rev. Lett. 48, 335 (1982). 

4A. Albrecht and P. Steinhardt, Phys. Rev. Lett. 48, 1220 (1982). 


201 


4 Inflation 


theories of new inflation was a nearly exponential expansion during the 
slow roll of one or more scalar fields down a potential hill, which is the 
main subject of Section 4.2. This provided a basis for “chaotic inflation” and 
“eternal inflation” and other variants, some of which are briefly described 
in Section 4.3. 

So far, the details of inflation are unknown, and the whole idea of infla- 
tion remains a speculation, though one that is increasingly plausible. Aside 
from the classic problems that inflation solved at the beginning, it has had 
one significant experimental success: a prediction of some of the proper- 
ties of the fluctuations in the cosmic microwave background and large scale 
structure. We will come to this in Chapter 10, after we take up the evolution 
of fluctuations in the early universe in Chapters 5 and 6 and the observation 
of these fluctuations in Chapters 7 and 8 and Section 9.5. 


4.1 Three puzzles 


In this section we will outline three classic cosmological problems, and work 
out the extent of the inflation required to solve each of them. For this 
purpose we will here simply assume that the universe went through an early 
period of exponential expansion, without worrying yet about how this came 
about. 


A. Flatness 


As we saw in Section 1.6 and 1.8, the observed Type Ia supernova redshift- 
distance relation and measurements of the ages of the oldest stars are 
consistent with a vanishing spatial curvature parameter Qx, though a non- 
vanishing curvature can be accommodated by changing Qy. Including 
data from the cosmic microwave background temperature fluctuations, dis- 
cussed in Section 7.2, favors Qx = 0. Although there is still room for a small 
non-zero Qg, it seems quite safe to conclude from these observations that 
|Qx| < 1. But Qg is just the present value of the dimensionless time depen- 
dent curvature parameter — K /a7H* = —K /å4?, with K constant. From the 
time the temperature dropped to about 104 K until near the present, a(t) has 
been increasing as ¢7/3, so |K|/å? has also been increasing as f7/3 œ T~!. 
Thus, if |Qx| < 1, then at 104 K the curvature parameter |K|/a? could not 
have been greater than about 1074. Earlier, a(t) was increasing as t!/?, so 
|K|/a? was increasing as t x T~*. In order for |K|/a? at 10+ K to be no 
greater than about 1074, it is necessary that |K|/a? was at most about 10716 
at the temperature T ~ 10!° K of electron-positron annihilation (roughly, 
the beginning of the period of neutron—proton conversion that results in 
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the observed helium abundance), and even smaller at earlier times.’ This 
is not a paradox—there is no reason why the curvature should not have 
been very small—but it is the sort of thing physicists would like to explain 
if we can. 

What Guth realized was that during inflation @/a would have been 
roughly constant, so |K|/a*H? would have been decreasing more or less 
like a~*. So to understand why space was so flat at the beginning of 
the present big bang it is not necessary to make any arbitrary assump- 
tions; if the radiation—dominated big bang was preceded by a sufficient 
period of inflation, it would necessarily have started with negligible 
curvature. 

To put this quantitatively, suppose the universe began with a period of 
inflation during which a(t) increased by some large factor e™, followed by 
a period of radiation dominance lasting until the time of radiation—matter 
equality, followed in turn by a period of matter dominance and then a period 
dominated by vacuum energy. If |K|/a? H? had a value of order unity at the 
beginning of inflation, then at the time ty of the end of inflation |K|/a? H? 
would have had a value |K| /ayH 7 of order e™?™ (where a7 and H; are the 
Robertson—Walker scale factor and expansion rate at this time), and today 
we will have 


IKI — oy (ari \? 
Dee See E (4.1.1) 
ao H aoHo 


Thus the flatness problem is avoided if the expansion during inflation has 
the lower bound 


aH 
NaS 
ag Ho 


(4.1.2) 


To evaluate this we will make the somewhat risky assumption that not 
much happens to the cosmic scale factor and expansion rate from the end 
of inflation to the beginning of the radiation-dominated era, so that 


aj Hy = a, ; (4.1.3) 


the subscript 1 denoting the beginning of the radiation-dominated era. 
We can express the expansion ao/a, of the universe since the start of the 
radiation-dominated era in terms of ratios of expansion rates by noting that 
over the whole of the radiation and matter-dominated era, the expansion 


SR. H. Dicke and P. J. E. Peebles, in General Relativity — An Einstein Centenary Survey, eds. S. 
Hawking and W. Israel (Cambridge University Press, 1979). 
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rate was 


H 3 4 
ga a eN; 
v2 a 
where ago = aoQr/Qu and Hgo = J22u Ho(a0/aEQ)* are the scale 
factor and expansion rate at matter—radiation equality. Setting a =a, & 
apo gives 


H 2 
m=- (2) (4.1.4) 
2 \a 


Using this relation to eliminate a1, we can put the bound (4.1.2) in the more 
useful form 


Ns (22) Hi al az e — [a] 
Ho Doc L) 0.037 h eV ; 


(4.1.5) 


where p; is the energy density at the beginning of the radiation-dominated 
era, and / crit = [3.00 x 1073 eV]4}? is the critical density (1.5.28). 

To go further, we need some idea of the energy density at the end of 
inflation. The success of the theory of cosmological nucleosynthesis shows 
that pı cannot be less than the energy density at the time of the beginning 
of neutron—proton conversion, roughly [1 MeV]‘, in which case Eq. (4.1.5) 
with h = 0.7 requires that the universe expanded during inflation by at least 
a factor 4 x 10’, or 17 e-foldings. At the other extreme, we would not expect 
p1 to be greater than the Planck energy density G7? = [1.22 x 101° GeV]*, 
in which case Eq. (4.1.5) with A = 0.7 would require that the expansion 
during inflation was at least by a factor 5 x 10’, or 68 e-foldings. We 
will see some evidence in Section 10.3 that p1 is of order [2 x 10!® GeV]*, 
in which case e™ for h = 0.7 would have to be at least 8 x 102°, so that 
N > 62. 

This is the least convincing of the arguments for inflation, because the 
small value of |K|/d? in the past could be explained by the assumption (one 
that was often made before anyone heard of inflation) that space is precisely 
flat, so that K = 0 now and always. On the other hand, as we will discuss in 
Section 4.3, inflation opens up the interesting possibility that the universe 
in the large is not at all homogeneous and isotropic, and that its apparent 
flatness of the cosmic metric is just the result of inflation. 

It would be quite a coincidence if inflation lasted for precisely the right 
number of e-foldings so that |K|/a? would have decreased during infla- 
tion from an initial value of order unity just enough so that its subse- 
quent increase during the radiation and matter-dominated eras would have 
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brought it back to a value of order unity now. It seems more likely that 
the present value |Qx| would be either much larger or much smaller than 
unity, and since observations tell us that it is not much larger than unity, 
inflationary theories suggest that |Qx| « 1. (But this sort of reasoning 
would also suggest that the vacuum energy is much less than the present 
matter density, which we now know is not the case.) 


B. Horizons 


From the beginning the observed high degree of isotropy of the cosmic 
microwave radiation background posed a problem. Recall that the hori- 
zon size in a matter- or radiation-dominated universe is of order t, which, 
because a(t) has increased as 17/ since the time of last scattering, was of 
order dy © Hy (1+zz)~/? at the time of last scattering. (See Eq. (2.6.32).) 
Also, according to Eq. (2.6.29), the angular diameter distance d4 to the sur- 
face of last scattering is of order Hp l (1 +zz)7!, so the horizon at the time 
of last scattering now subtends an angle of order dy/d4 ~ (1 + zr)? 
radians, which for zz œ 1100 is about 1.6°. Therefore in a matter- or 
radiation-dominated universe no physical influence could have smoothed 
out initial inhomogeneities and brought points at a redshift zz that are sep- 
arated by more than a few degrees to the same temperature, in contradic- 
tion with the nearly perfect isotropy of the microwave background at large 
angular scales observed ever since the background radiation was discovered. 
Inflation provides an explanation: during the inflationary era the part of 
the universe that we can observe would have occupied a tiny space, and 
there would have been plenty of time for everything in this space to be 
homogenized. 

To work out what this means for the expansion during inflation, first 
recall that as discussed in Section 1.13, the proper horizon size at the time 
ty of last scattering is 


tL dt 


roe (4.1.6) 


dy (tL) = atn) | 

bx 
with t, (possibly equal to —oo) the beginning of the era of inflation. We 
have seen that the contribution to the integral from the radiation and matter- 
dominated eras is much too small to account for the isotropy of the 
microwave radiation background, so we will assume that the integral is 
dominated by an era of inflation. For definiteness, we assume that during 
inflation a(t) increased exponentially at a rate H7, so that 


a(t) = a(t.) exp (Hrt — te) = arexp (= Hr(tr = D) , 
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where ty is again the time of the end of inflation, and ay = a(t;). With 
N = H(t; — t,) the number of e-foldings of expansion during inflation, 
Eq. (4.1.6) gives 


a(tz) [ev - 1] l 


dy (tL) = H, 


(4.1.7) 


In order to have any hope of solving the horizon problem, we must have 
eN œ 1, so we can drop the term —1 in square brackets in Eq. (4.1.7). 

To account for the observed high degree of isotropy of the cosmic 
microwave background at large angular scales we need dy (tL) > d4(tL), 
where d4 (tL) is the angular—diameter distance of the surface of last scatter- 
ing. According to Eq. (2.6.29), 


dalt) © ae 3 


(4.1.8) 
The condition dy (tL) > d4(tL) for the isotropy of the cosmic microwave 
background is then 


aH 
eN o I 


; 4.1.9 
ni, (4.1.9) 


This is the same as the condition (4.1.2) for the solution of the flatness 
problem. If we again make the assumption that not much happens between 
the end of inflation and the beginning of the radiation-dominated era, and 
use Eq. (4.1.3), then to solve the horizon problem we again need N > 17 if 
pı > [1 MeV]*, N > 62 if pı ~ [2 x 10!° GeV]*, and N > 68 if o1 ~ G~?. 
We will see in Chapter 10 that whether 17 or 62 or 68 e-foldings are needed to 
solve the horizon problem, it is only that number of e-foldings before the end 
of inflation that can be explored through observations of nonuniformities 
in the present universe. 

It should be noted that the time tz of last scattering does not enter in the 
bound (4.1.9) on M, so this is also the condition that the whole sky at any 
redshift z < zgo was within the horizon at the time that light observed now 
with that redshift left its source. Indeed, Eq. (4.1.9) is also the condition 
that the horizon size at the present should be greater than the size of the 
observable universe, which is roughly 1/Ho. 


C. Monopoles 
In grand unified theories local symmetry under some simple symmetry 


group is spontaneously broken at an energy M ~ 10!6 GeV to the gauge 
symmetry of the Standard Model under the group SU(3) x SU(2) x U(1). 
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In all such cases, the scalar fields that break the symmetry can be left in 
twisted configurations that carry non-zero magnetic charge and that cannot 
be smoothed out through any continuous processes. This poses a prob- 
lem for some cosmological models.’ The scalar fields before this phase 
transition would have necessarily been uncorrelated at distances larger than 
the horizon distance, the farthest distance that light could have traveled 
since the initial singularity. At an early time ¢ in the standard big bang 
theory the horizon distance was of order t ~ (G(kgT)*)~!/? (where G ~ 
(10!9 GeV)~? is Newton’s constant), so the number density of monopoles 
produced at the time that the temperature drops to M /kg would have been 
of order 17? ~ (GM*)?/2, which is smaller than the photon density ~ M? at 
T ~ M /kg by a factor of order (GM~)?/?. For M ~ 10!° GeV this factor is 
of order 107°. If monopoles did not find each other to annihilate, then this 
ratio would remain roughly constant to the present, but with at least 10° 
microwave background photons per nucleon today, this would give at least 
one monopole per nucleon, in gross disagreement with what is observed. 
This potential paradox was one of the factors leading to interest in 
inflationary cosmological models. In such models, a period of exponential 
expansion that occurred before the monopoles were produced would have 
greatly extended the horizon, and an exponential expansion that occurred 
after the production of monopoles (but before photons were created in a 
period of reheating) would have greatly reduced the monopole to photon 
ratio. To be specific, the search for monopoles in iron ore, seawater, etc. 
shows that there are fewer than 10~° per gram, or about 1073? monopoles 
per nucleon, and hence fewer than about 1073? monopoles per photon.® 
(With this abundance, even if the monopole mass were as large as 10!° GeV, 
they would make a negligible contribution to the cosmic mass density.) In 
order for inflation to have reduced the monopole/photon ratio by a fac- 
tor 10730, it must have increased the horizon size (at some time before the 
reheating that creates photons) by a factor 10!°. That is, the horizon size 
eN /H; after inflation must be greater than the previous estimate (GM*)~!/2 
by at least a factor 10!°. For Hı ~ (GM*)!/? this requires the number M of 
e-foldings to be greater than In 10! = 23. Of course, another possible solu- 
tion of the monopole problem is that inflation ends at a temperature below 
the grand unification scale M, so that there never was a time when the grand 
unification group was unbroken. An even simpler possibility, which does 
not rely on inflation, is that there may be no simple gauge group that is 


6Fora discussion, see QTF, Vol. II, Sec. 23.3. 

TYa. B. Zel'dovich and M. Yu. Khlopov, Phys. Lett. B79, 239 (1978); J. Preskill, Phys. Rev. Lett. 
43, 1365 (1979). For a review, see J. Preskill, Annual Rev. Nucl. Part. Science 34, 461 (1984). 

8For a review, see Particle Data Group, Phys Lett. B 582, 1001 (2004). 
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spontaneously broken to the gauge group SU(3) x SU(2) x U(1) of the 
Standard Model. 


The most serious of the above three problems is the horizon problem. As 
we have seen, there are possible solutions of the flatness and monopole 
problems that do not rely on inflation. Also, any number of e-foldings 
of inflation that solves the horizon problem automatically solves not only 
the flatness problem, but also the monopole problem. If the radiation- 
dominated era begins with an energy density p; > [10!° GeV]* then for 
inflation to solve the flatness and horizon problems we need at least 59 
e-foldings of inflation, which is more than enough to avoid the monopole 
problem, while if o1 < [10!° GeV]* GeV then in the usual picture of grand 
unification there would be no monopoles at all. 


4.2 Slow-roll inflation 


In Guth’s original work, inflation was conceived to be due to a delayed 
first-order phase transition, in which a scalar field was initially trapped in 
a local minimum of some potential, and then leaked through the poten- 
tial barrier and rolled toward a true minimum of the potential. It was 
soon realized! that this idea does not work, because of what has come 
to be called the graceful exit problem. The transition from the super- 
cooled initial “false vacuum” phase to the lower energy “true vacuum” 
phase could not have occurred everywhere simultaneously, but here and 
there in small bubbles of true vacuum, which rapidly expanded into the 
background of false vacuum, in which the scalar field would have been 
still trapped in its local minimum, like water droplets forming in super- 
cooled water vapor. The trouble is that the latent heat released in the phase 
transition would have wound up in the bubble walls, leaving the interi- 
ors of the bubbles essentially empty, so that the only places where there 
would be energy that could grow into the present contents of the universe 
would be highly inhomogeneous and anisotropic. At first Guth thought 
the bubbles in inflationary cosmologies would have merged, leading to our 
present more-or-less homogeneous universe, but this could not have hap- 
pened; because the background false-vacuum space continued to inflate, 
the bubble walls would have moved too fast away from each other ever to 
have coalesced. 


Is W. Hawking, I. G. Moss, and J. M. Stewart, Phys. Rev. D 26, 2681 (1982); A. H. Guth and E. J. 
Weinberg, Nucl. Phys. B 212, 321 (1983). 

2For a description of this process and references to the original literature on bubble formation in 
quantum field theory, see QTF II, Section 23.8. 
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Guth’s version of inflation was soon supplanted by a version due to Linde 
and to Albrecht and Steinhardt, known as ‘new inflation.’ Originally new 
inflation was formulated in a particular model of the breakdown of a grand 
unified symmetry, using a symmetry-breaking mechanism introduced by 
Coleman and E. Weinberg.* With this mechanism the zero-temperature 
potential for a scalar field ¢ is artificially adjusted to have zero second 
derivative at ¢ = 0. One-loop radiative corrections then give a poten- 
tial equal to a known positive factor times y*In(y/M), where M is a free 
constant; changing the value of M amounts to changing the 4 coupling 
constant. This potential has an unstable stationary point at g = 0 anda 
minumum at go = Me~'/4. At finite temperature T there is also a quadratic 
term in the potential, proportional to T*y*, which makes the stationary 
point at ọ = 0 into a local minimum. Again the phase transition occurs by 
forming bubbles, but for low temperature the potential barrier is very small, 
and so the scalar field in the interior of the bubble starts with g nearly zero. 
The field then rolls slowly down the potential, in the manner discussed 
in Section 1.12, while the universe (including the bubble) undergoes an 
exponential expansion. Eventually the field energy is converted into 
ordinary particles, filling the bubble. Our observable universe is supposed to 
occupy a small part of one such bubble. 

The consequences of the new inflationary theories turned out to depend 
on the slow roll of the scalar field after bubble formation, rather than the 
process of bubble formation itself. Indeed, the important aspects of inf- 
lation do not really require any assumptions about grand unification or the 
Coleman—E. Weinberg mechanism. All we need to assume is that there is a 
scalar field g, known as the inflaton, which at some early time takes a value at 
which the potential V (ọ) is large but quite flat. The scalar field “rolls” very 
slowly at first down this potential, so that the Hubble constant decreases only 
slowly, and the universe experiences a more-or-less exponential inflation 
before the field changes very much. 

To put this quantitatively, recall that the energy density (B.66) and 
pressure (B.67) of a spatially homogeneous scalar field g(t) with potential 
V (gy) in a Robertson—Walker spacetime take the form (with A = c = 1) 


l> eee 
p= e + Vip) s es = V0), 


so the energy conservation equation ò = —3H(p + p) takes the form: 


$+3H¢o+V'(g) =0, (4.2.1) 


3A. D. Linde, Phys. Lett. B 108, 389 (1982); 114, 431 (1982); Phys. Rev. Lett. 48, 335 (1982); 
A. Albrecht and P. Steinhardt, Phys. Rev. Lett. 48, 1220 (1982). 
45S. Coleman and E. Weinberg, Phys. Rev. D 7, 1888 (1973). 
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where H = d/a is the time dependent expansion rate, which during the 
period of scalar field energy dominance is given by 


_ j8Gp  j&rG/l1., 
H=; 3 -J 3 (56 +). (4.2.2) 


From Eqs. (4.2.1) and (4.2.2) we can derive an extremely useful formula 
for H. By taking the time derivative of the square of Eq. (4.2.2) and then 
using Eq. (4.2.1), we have 


_  8nG 
2HH = = (ë + V'(@)¢) = 8x GH", 


and therefore 
H =—4nG¢@". (4.2.3) 


Now, in order to have a nearly exponential expansion, the fractional 
change |H/H|(1/H) in H during an expansion time 1/H must be much 
less than unity. That is, we must have 


|H|<H’. (4.2.4) 
With Eqs. (4.2.3) and (4.2.2), this requires that 
H KIV). (4.2.5) 


This has the consequence that p ~ —p, and also 


H~ —— l (4.2.6) 


Usually it is also assumed that the fractional change |¢/g|(1/H) in @ 
during an expansion time 1/H is much less than unity. That is, 


\d| KX Hlọl. (4.2.7) 


This has the consequence that we may drop the inertial term ¢ in Eq. (4.2.1), 
which then becomes 

_V@__ vø aso 
3H 24n GV (Y) A 


The fractional change of the expansion rate H in an expansion time 1/H 
will then be 


H 1/3 |V@~e|_ 1 Vor 
H? 2 a m ~ arle) ; ee) 
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so the exponential expansion of the universe will last for many e-foldings if 


ce K V16nG . (4.2.10) 
Vg) 


According to Eq. (4.2.8), the condition on the potential for the inequality 
(4.2.5) to be satisfied is that 


oe Ol k VAG, (4.2.11) 
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which is guaranteed by the inequality (4.2.10). Also, Eq. (4.2.8) gives 
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(4.2.12) 


The inequality (4.2.10) ensures that the absolute value of the last term on 
the right-hand side is much less than |V’(@)|, so the condition for |¢| to be 
much less than |V’(g)| is that |V” (p)| « 9H, or, in other words, 


| v"@) 
Vy) 


Eqs. (4.2.10) and (4.2.13) are the two “flatness” conditions needed to insure 
the slow roll of both g and @. It is possible in principle that the second flat- 
ness condition (4.2.13) may not be satisfied for a potential that does satisfy 
the first flatness condition (4.2.10), but this is unusual, and in particular is 
not possible for the simple potentials discussed below. 

Under these conditions the expansion is generally not strictly exponen- 
tial, but it can easily be exponentially large. Suppose that during some time 
interval the field y(t) shifts from an initial value g; to a final value g2, with 
0 < V(g2) < V(¢1), with both inequalities (4.2.10) and (4.2.13) assumed 
valid over this range of y. The Robertson—Walker scale factor will increase 
during this period by a factor 


t 
alt) _ a f 2 nad Ss i nae) 
a(tı) ti 91 p 


92 
~ exp |- / (=) do| : (4.2.14) 
wN PE 
In this range the potential is positive and decreases as g(t) runs from g to 
92, so the argument of the exponential in Eq. (4.2.14) is positive. Condition 
(4.2.10) tells us that this argument is much greater than /427 G|g) — ¢2|, so 
this flatness condition guarantees that we get a large number of e-foldings 


K24nG . (4.2.13) 
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in any time interval in which gy changes by an amount at least as large as 
1/42 G = 3.4 x 10!8 GeV. 

It is important to note that such large values of the scalar field do not 
necessarily rule out the classical treatment of gravitation on which we have 
been relying. The condition that allows us to neglect quantum gravitational 
effects is that the energy density should be much less than the Planck energy 
density: 


IVO] K (42G)~ . (4.2.15) 


This condition can be satisfied, even if y is comparable to the Planck mass 
~ G12, by supposing that V(g) is proportional to a sufficiently small 
coupling constant. Neither the flatness conditions (4.2.10), (4.2.13) nor the 
growth (4.2.14) of a(t) for a given change in g(t) depend on the value of 
such a coupling constant. 

Depending on the potential shape, the flatness conditions (4.2.10) and 
(4.2.13) may provide either conditions on the initial value of the scalar field 
or on the parameters of the potential itself. As an example of the first sort, 
consider the power-law potential 


V) =g", (4.2.16) 


with g and « arbitrary real parameters, except that we assume that g > 0 and 
take |æ| larger but not orders of magnitude larger than unity. The flatness 
conditions (4.2.10) and (4.2.13) are then both satisfied for |g| >> 1//W47G, 
irrespective of the value of the coupling constant g. The number of e- 
foldings of expansion for a scalar field that starts at a value gı and ends at 
a much smaller value is given by Eq. (4.2.14) as 87 Go; /a, so for instance 
for a = 4 we get the 62 e-foldings needed to avoid the horizon problem 
for inflation ending at a temperature 2 x 10! GeV/kg if |y1| > /31/aG. 
On the other hand, for this potential the condition (4.2.15) for the neglect 
of quantum gravitational effects does put an upper bound on |g|. For 
instance, in the case a = 4, the potential (4.2.16) will satisfy condition 
(4.2.15) if g « (V42G|y|)~* so, with |ọ| just large enough to get 62 e- 
foldings of exponential expansion, to avoid quantum gravity corrections we 
need g < 2 x 1075. The need for a very small coupling can be avoided in 
theories with more than one scalar field, such as “hybrid inflation” theories,’ 
in which the effective self-coupling of one scalar field is very small because 
the other scalar field has a very small mass and hence a small expectation 
value. 


5A. D. Linde, Phys. Lett. B 259, 38 (1991); Phys. Rev. D 49, 748 (1994) [astro-ph/9307002]. 
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There is a well-known example of a rather different sort for which 
Eqs. (4.2.1) and (4.2.2) can be solved exactly, the exponential potential:° 


V (p) = gexp(—Ag) , (4.2.17) 
with g and à arbitrary constants. It is easy to verify that Eqs. (4.2.1) and 
(4.2.2) are satisfied by’ 

1 (82Gge7?? 
poy in == )'5 (4.2.18) 
À 3—e€ 
and 
axt, H=l/et, (4.2.19) 
where € is the positive dimensionless quantity 
2 
= ; 4.2.20 
«= 16nG ee 


The flatness conditions (4.2.10) and (4.2.13) here respectively read € « 1 
and e < 3/2, so both are satisfied if and only if 


exl, (4.2.21) 


with no constraint on the values of either g or gy. This exact solution is useful 
as a check of approximate calculations for more general potentials. For 
instance, Eq. (4.2.18) may be compared with the solution of the approximate 
equation (4.2.8), which for the exponential potential is 


l 82 Gge?t? 
oon = Lin( 3 ). 


The difference between this and the exact solution (4.2.18) is evidently neg- 
ligible for € « 1. Likewise, the increase in the Robertson—Walker scale 
factor during a time interval from f; to t2 in which the field drops from ¢ 
to g2 Is 


(4.2.22) 


1 
a(t) _ (2) = eona (4.2.23) 
a(tı) ty 


6L. F. Abbott and M. B. Wise, Nucl. Phys. B 244, 541 (1984); F. Lucchin and S. Matarrese, 
Phys. Rev. D 32, 1316 (1985); Phys. Lett. B 164, 282 (1985); D. H. Lyth and E. D. Stewart, Phys. Lett. 
B 274, 168 (1992). 

7This is an exact solution for all values of z, but it is not the most general solution. Since Eqs. (4.2.1) 
is a second-order differential equation, it has a two-parameter set of solutions. The particular solution 
(4.2.18) is an attractor, which the general solutions approach for large t. 
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which is exactly the same as the “slow-roll” result (4.2.14) for the exponential 
potential. 

It is assumed in these theories that when V (gy) dropped sufficiently far, 
the inequalities (4.2.10) and (4.2.13) were in general no longer be satisfied, 
and ọ began a damped oscillation around the minimum of V(¢), which is 
at the present value gp. Eventually y would have approached close enough 
to go so that we can approximate the potential as a quadratic, 


1 
Vig) = z (y — go)” . (4.2.24) 


(In order to account for the small present value of the vacuum energy, it 
is necessary to assume that, for reasons that remain entirely mysterious, 
the minimum value of the potential is very close to zero.) This is just like 
the field theory of spinless particles with mass m and negligible velocity. 
In order to have ended inflation, there must also be some coupling of the 
inflaton scalar field to other fields, including the fields of ordinary matter 
and radiation, so that the energy density of the inflaton field decreased as 


a(t) \> 
po(t) = poltr) (2) eet, (4.2.25) 
where I is the rate of decay of the g quanta into other particles, and ty is 
taken at the beginning of the inflaton oscillation and decay. This is known 
as the period of reheating.® It is this period in which the entropy observed 
in the present universe is supposed to be generated.? 
The energy density py of the particles into which g decayed satisfies a 
conservation equation like Eq. (1.1.32), but corrected to take account of the 
flow of energy from the inflaton: 


bm +3H(om +pm) =T% . (4.2.26) 


For definiteness, we will assume that the decay products of the inflaton are 
highly relativistic, so that pm = pm /3. Then the solution of Eq. (4.2.26) is 
PDT ÈC) f! 

at (t) tī 


In contemporary models of inflation this is the source of all the matter 
and radiation in the present universe. (In using this relation it is important 


pm (t) = a(t) e TED dy! (4.2.27) 


8The term reheating is a historical relic of early theories of inflation in which it was assumed that the 
zero-temperature slow roll of the inflaton field followed an earlier period of high temperature. 

9A. D. Dolgov and A. D. Linde, Phys. Lett. 116B, 329 (1982); L. F. Abbott, E. Farhi, and M. B. 
Wise, Phys. Lett. 117B, 29 (1982). 
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to keep in mind the possibility of parameteric amplification of the density 
Po(tr). That is, scalar field expectation values can increase the effective mass 
of the inflaton field ọ, and thereby increase its energy density. !°) 

The matter energy density (4.2.27) starts equal to zero at t = tr, then 
rises at first, and finally falls as the density is attenuated by the expansion of 
the universe. It is of some interest to find the value of p m (t) at its maximum, 
because this tells us the maximum temperature ever reached, which controls 
the kinds of relics—cold dark matter, baryons, monopoles, axions—left over 
from the early universe. This maximum density can be easily calculated in 
two extreme cases, for T >> H (tr) and T < A (ty). 

For TP > H (tr), we can express Eq. (4.2.27) as a power series in H (tr)/ T 
by repeated integration by parts 


S a(ty)\* (, H(t 
puto) = pet (2) (1+ ' +...) (4.2.28) 


We see that in this case om jumped up almost immediately to the value 
Po(tr), and then decreased with the usual a“ factor, so in this case all the 
energy of the inflaton field at the end of inflation went into ordinary matter 
and radiation. This was the assumption made in deriving lower bounds on 
the number of e-foldings of inflation in the previous section. 

For r « A(t), the maximum value of p(t) was reached at a time 
when the exponential eT- in Eq. (4.2.27) had not yet begun to decay. 
(This will be checked below.) Setting this factor equal to unity, we have 


py(tr)Ta(tr) f! 
at (t) fr 


At this time the energy density of the universe was still dominated by the 
inflaton, so a(t) = a(tr)(t/tr)?’, and Eq. (4.2.29) becomes 


3 8/3 5/3 
pu(t) = ZP tr polir) (=) (£) 5 ) (4.2.30) 
t tī 


This reached a maximum at ¢ = (8/3)3/>t;, which incidentally confirms 
that, under the assumption that T « H (tr) = 2/3tr, the argument of the 
exponential in Eq. (4.2.27) at this maximum was still negligible. At this 
maximum, the matter density is 


pu (t) = a(t’) dt’ (4.2.29) 


PM max = (3/8) ^T tr pp (tr) = 0.139 (r/R) polt). (4.2.31) 


10L, Kofman, A. D. Linde, and A. A. Starobinsky, Phys. Rev. Lett. 73, 3195 (1994); Phys. Rev. D 
56, 3258 (1997). 
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In this case the maximum energy density at the beginning of the radiation- 
dominated era would have been much less than the energy density in the 
inflaton field at the end of inflation. 


4.3 Chaotic inflation, eternal inflation 


It was soon realized that some “new inflation” models actually entail an 
endless production of inflating bubbles.! This has come to be called “eternal 
inflation.”? To take one example, if the inflaton scalar field at a given point in 
space once had a value of unstable equilibrium (like at the top of a potential 
hill) then the probability that the inflaton field was still at this value after 
a time ¢ decreased as exp(—yt). However, the volume in which the scalar 
field had this value was meanwhile increasing as exp(+3H1), so as long as 
3H > y the volume of space that still undergoes inflation eternally increases 
exponentially. 

We have been assuming that the scalar field is initially independent 
of position, aside from small perturbations, about which more in 
Chapter 10. Soon after the introduction of new inflation, the possibilities 
of inflationary theory were greatly expanded and improved when Linde? 
proposed the theory of “chaotic inflation,” in which initially one or more 
scalar fields varied in a random way with position. Here and there one 
would have found patches of space in which an inflaton field took a nearly 
uniform value at which the potential satisfied the slow-roll conditions 
(4.2.10) and (4.2.13), as for instance a value substantially greater than 
the Planck mass for a power-law potential. Inflation will then have 
occured in such a patch, provided the patch was initially sufficiently 
large. 

It is necessary to require that the uniform patch be sufficiently large, 
because the scalar field Lagrangian density given by Eq. (B.63) contains 
a term involving spatial derivatives, which for a non-uniform scalar field 
contributes a term —a~*V7¢ on the left-hand side of Eq. (4.2.1). In order 
for this term not to interfere with the slow-roll analysis of the previous 
section, we need the scale L of proper distances over which ¢ takes a roughly 
constant initial value to be greater than |y/V'(y)|'/*. For instance, for the 


'P J. Steinhardt, in The Very Early Universe — Proceedings of the Nuffield Workshop, 1982, eds. 
G. W. Gibbon S. W. Hawking, and S. T. C. Siklos (Cambridge University Press, 1983): 251; A. Vilenkin, 
Phys. Rev. D 27, 2848 (1983). 

2For reviews of this and other variants of inflation, see A. Guth, talk given at the Pritzker Symposium 
on the Status of Inflationary Cosmology, January 1999, astro-ph/0002188; A. Linde, J. Phys. Conf. Ser. 
24, 151 (2005) [hep-th/0503195]. 

3A.D. Linde, Phys. Lett. 129B, 177 (1983). 
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potential V (p) = gq’, this condition gives 


l l a 
LP aaa = (=o) COK (4.3.1) 


The classical field theory condition (4.2.15) makes the first factor much 
larger than unity, while the slow-roll condition (4.2.10) makes the second 
factor also much larger than unity, so the patch size must be very much 
greater than the third factor, which is essentially the Planck length. Such 
relatively large uniform patches may be quite rare, but that is no argument 
against this hypothesis, because life can only arise in big bangs that stem 
from such patches. 

A sufficiently large patch will inflate to an enormous size, which to 
observers deep inside seems highly homogeneous and isotropic. In this 
way, chaotic inflation solves a puzzle that had not generally been realized to 
be a puzzle, even when the first inflation theories were being developed. It 
explains not just why the Robertson—Walker metric in which we find our- 
selves was remarkably flat in the past; it also explains why we find ourselves 
in a Robertson—Walker metric at all. Unfortunately, it is hard to see how 
we will ever observe any part of the universe beyond our inflated patch. 
The validity of the idea of chaotic inflation will probably have to come from 
progress in fundamental physics, which may verify the existence of a suitable 
inflaton field, rather than from astronomical observation. 

Even when the scalar field in a patch of the space was large enough to 
start a slow roll inflation, quantum fluctuations in smaller regions within 
that patch would subsequently have driven the inflaton field to even higher 
values, so that these regions will begin an earlier stage of inflation.’ In this 
way, chaotic inflation turns out also to be eternal. 


4A. D. Linde, Mod. Phys. Lett. A1, 81 (1986); Phys. Lett. B 175, 395 (1986); A. S. Goncharev, 
A. D. Linde, and V. F. Mukhanov, Int. J. Mod. Phys. A 2, 561 (1987); A. D. Linde, D. A. Linde, and 
A. Mezhlumian, Phys. Rev. D 49, 1783 (1994) [gr-qc/9306035]. 
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General Theory of Small Fluctuations 


In most of the work described in the previous chapters, the universe has been 
treated as isotropic and homogeneous, with a gravitational field described 
by the Robertson—Walker metric. This is of course just an approximation, 
which ignores many of the most interesting things in the universe: galaxy 
clusters, galaxies, stars, us. We now turn to an analysis of these departures 
from homogeneity and isotropy. 

In this chapter we will lay a foundation for this analysis, by deriv- 
ing the general relativistic equations that govern small fluctuations, and 
drawing general conclusions about their implications. Chapter 6 will apply 
this formalism to the evolution of structure, from the radiation-dominated 
era to near the present. In Chapters 7 and 8 we apply the results obtained 
in Chapter 6 to the observed fluctuations in the cosmic microwave back- 
ground and to the growth of structure. Chapter 9 deals with gravitational 
lensing, which may in the long run provide the best tool for analyzing the 
large scale structure of dark matter. These chapters are kept at a general 
level, independent of detailed assumptions about an inflationary era before 
the radiation-dominated era. Chapter 10 will then explore the implications 
of inflationary theories for the calculations of Chapters 6 through 9. Some 
readers may prefer to skip on immediately to Chapter 6, using the present 
chapter as a source of useful formulas, while others will do better to read 
these chapters in order. 


5.1 Field equations 


As an essential feature of the analysis presented here, we assume that during 
most of the history of the universe all departures from homogeneity and 
isotropy have been small, so that they can be treated as first-order pertur- 
bations.! Because the observable universe is nearly homogeneous, and its 
spatial curvature either vanishes or is negligible until very near the present, 
we will take the unperturbed metric to have the Robertson—Walker form 
(1.1.11), with curvature constant K = 0. (Effects of a possible finite curva- 
ture at times close to the present will be included in Chapters 7 through 9 


The study of first-order cosmological fluctuations was initiated by E. Lifshitz, J Phys. U.S.S.R. 
10, 116 (1946). Classical second-order corrections were worked out by K. Tomita, Prog. Theor. Phys. 
37, 831 (1967); 45, 1747 (1970); 47, 416 (1971). For recent work, see K. Tomita, Phys. Rev. D 71, 
3504 (2005); N. Bartolo, S. Matarrese, and A. Riotto, J. Cosm. & Astropart. Phys. 0606, 024 (2006) 
[astro-ph/0604416]. 
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where they are relevant.) The total perturbed metric is then 
Suv = 8w thw , (5.1.1) 
where g,» is the unperturbed K = 0 Robertson—Walker metric 
Zo0=-1, B0=%i=0, Fy =a OS,, (5.1.2) 


and uv = Ay, is a small perturbation. (Here and from now on, a bar 
over any quantity denotes its unperturbed value.) The perturbation to the 


inverse of a general matrix M is ôM -1 = —M7! (8M) M" !, so the inverse 
metric is perturbed by 
h”? = gi? — g” = _glPgreh , (5.1.3) 


with components 
hi =-a-*hy, bh =a*hi, h” = —hoo. (5.1.4) 


Note the — sign in the last expression in Eq. (5.1.3); in our notation, the 
perturbation ég"” to g”” isnot given by simply using the unperturbed metric 
to raise the indices on ôg „v. 

The metric perturbation produces a perturbation to the affine connection 


1_ z 
Sri = 58" [-2hp lS, + Onhov + OyApr = dohan] i (5.1.5) 


For K = 0, the only non-vanishing components of the unperturbed affine 
connection are given by Eqs. (1.1.17)-(1.1.19) as 

a -0 ; 

= g ; Tý = adôij í 

Thus Eq. (5.1.5) gives the components of the perturbed affine connection 
as 


5 Mk = 5 (—2ad hio die + khi; + Əh — dihjx) (5.1.6) 
êri = 5 (-Žr + hi + djhio — aho) (5.1.7) 
ôr} = ; (2ačı Sij hoo — ajhio — dihjo + hij) (5.1.8) 
sriy = 5 (2hio — ahoo) (5.1.9) 
y= “ho - T asho (5.1.10) 
ôro = -Zoo (5.1.11) 
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In particular, we will need 


1 1 
À 
ôT = ðu È zhi — zho : 
To write the Einstein equations, we need the perturbation to the Ricci tensor 
à 
3a OST 
He 3x“ ax* 


+ary iy, + ory, Py, —3r7, PY, — ary, 8, , (5.1.12) 


ôR 


with components 
1 Fs looo; 
bRik = — 5 8% khoo — (2a + ait) dixhoo — 54 Sik hoo 


1 
+55 (V2 hie — 9jhic — A:9kj + Bðxhü) 


Îi å /; , ar a 
= pet a (ix z Sieh) Pa (—2hik + ôjkhii) + 7 Sik ôiMo 


2 (ajheo + dehjo) s (5.1.13) 


1 ; ` 
t3 (ajho + arto) Tig 


1 2 a 2a? 
ôRo; = ô Rio = “hoo + F a) (v hjo — ðjðihio) — (: + = | hjo 


ie ee 
+55 È 5 (dh — omy) , (5.1.14) 
1 3a. l, 
5Roo = — VŽhoo + —hoo — — dihi 
00 = 573 Y "00 + zg 00 T Ga iio 
1 |, 24, & ü 
ES ji- “tn +2(5- z) m] ‘ (5.1.15) 
2a a a a 
In general, we can put the Einstein field equations (B.71) in the form 
Ruy = —82GS py , (5.1.16) 
where 
1 
Suv = Tuy = zur Br" Too ; (5.1.17) 


(A cosmological constant can be accommodated by including a term in Tyv 
proportional to g,,,, with a constant coefficient.) The perturbation to the 
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energy-momentum tensor and metric produces a perturbation to the source 
tensor Suv: 


1 Liase 
Suv = Tuv — zEuvôT z zuv T’ ; (5.1.18) 
We are not assuming that the contents of the universe form a perfect fluid, 
but the rotational and translational invariance of the unperturbed energy- 
momentum tensor TH” require that it takes the perfect fluid form 
(B.42): 


Tuv =P Euv + (P + A) ituity , (5.1.19) 


where p(t), p(t), and u” are the unperturbed energy density, pressure, and 
velocity four-vector, respectively, with 7° = 1 and i = 0. Also, we use 
the unperturbed Einstein equations (1.5.17) and (1.5.18) to write 6 and pin 
terms of the Robertson—Walker scale factor and its derivatives 


3 à? 1 24 «a 
0o = — | — |, p= -— |— +l. 5.1.20 
P= aG (=) P 8G ( a T 5) ( ) 


It follows in particular that the unperturbed energy-momentum tensor has 
the trace 


Thus Eq. (5.1.18) gives 


p] oe -2 
a j: 3 [a à 
Sik = Tik — gror at EG (5) hik , (5.1.21) 
6S;9 = êT ;jo + : Daal hj (5.1.22) 
=R Gla eJ T ee 
So = ôT, q lar + : a h (5.1.23) 
pa al eas yo Wen N m 


The Einstein equations (5.1.16) thus take the form 


2 
EE E A E T A + a) Bch 
jk 7 ik Xr 7 j Ok n00 ik 00 
1. : 1 2 
— ace dik hoo + = (v hig — dið;hik — diðkhij + djðkhi) 


2a? 
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l. af; ; à? 3a a 
ae si (ix = Sxl) + (3 + 3) hik + (=) Sik hii 
1 
+ “5 ix Oihig + = 5 (hro + ahjo) + = + a (Ə;hko + ðkhjo) » (5.1.24) 
ba å 2ä 
—82 GbTjo = “yhoo += z2 (v hjo — djðihio) + 2 + i hjo 


la 
+5 = lz 5 (ðh - am) , (5.1.25) 


2 2a? 2a 


1 . 2 ee -2 ee 
+ a fu - 2 — aa +2 (3 — z) m] +3 (3 + z) hoo. (5.1.26) 
a a a a 


The components of the energy momentum tensor are subject to the 
conservation condition that 7“,.,, = 0, which to first order in perturba- 
tions gives 


1 1 3a: | eee 
—821G (Tw + iT) = —-V*hoo + ad — 5 ðihio 
a 


a,8THy + Pot» — Th oT", + °) Bnew ta — ôr}, T”, =0, (5.1.27) 


in which the perturbations to the energy-momentum tensor ôT”, with 
mixed indices can be calculated from 


8T", = 8% Sie il 4 (5.1.28) 


Setting v equal to a spatial coordinate index j gives the equation of 
momentum conservation 


224 
A957? +3; T+ 87") aad T o— TE djhoo— tho) = 0. (5.1.29) 


while setting v equal to the time coordinate index 0 gives the equation of 
energy conservation 


898 To+0)5T ip + 8T 9) — ST; (Z) (hatha) = 0. (5.1.30) 
a a 2a? a 

As remarked in Appendix B, these conservation equations are not 
independent conditions, but may be derived from the Einstein field equa- 
tions. However, it is often convenient to use either or both in place of one or 
two of the field equations. Also, in the frequently encountered case where 
the constituents of the universe are non-interacting fluids (such as one fluid 
consisting of cold dark matter and another consisting of ordinary matter 
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and radiation) these conservation equations are satisfied separately by each 
fluid, information that could not be derived from the field equations. 

The results obtained so far are repulsively complicated. Fortunately, the 
spatial isotropy and homogeneity of the unperturbed metric and energy- 
momentum tensor allow us to simplify these results by decomposing the per- 
turbations into scalars, divergenceless vectors, and divergenceless traceless 
symmetric tensors, which are not coupled to each other by the field equa- 
tions or conservation equations. The perturbation to the metric can always 
be put in the form? 


hoo = —E , (5.1.31) 
F 
hg =a E + G| : (5.1.32) 


a hake PB IC) ƏC; 
y= a axia xI g 3x à ax! 


+ p (5.1.33) 


where the perturbations A, B, Ci, Dj = Dji, E, F, and G; are functions of 
x and ¢, satisfying the conditions 
aC; 0G; ƏD; 
= = 0, 70, Di=0. (5.1.34) 
ax! ax! ax! 
To carry out a similar decomposition of the energy-momentum tensor, 
we first note that for a perfect fluid we would have 


Tuv = P&uv + (P +P , (5.1.35) 
with 
g} upy = —1, (5.1.36) 
Recalling that u; = 0 and up = —1, we find that the normalization condition 
Eq. (5.1.36) gives 
8u? = ŝuo = hoo/2 , (5.1.37) 


while ôu; is an independent dynamical variable. (Note that ŝu” = ô(g”” up) 
is not given by g"’du,.) Then the first-order perturbation to the energy- 
momentum tensor for a perfect fluid is 


ôT = phy +.a°6j6p, 5Ti = phio — (6 + p)ôui, Too = — 5 hoo + dp. 
(5.1.38) 


2To see this, we define F, A, and B, as the solutions of VF =a lashio. V2A+V4B=a 23iðjhiks 
and 34 + V?B = a™?hii, then define C; as the solution of v?C; = a7 ajhjx — il A + V?B], and then 
use Eqs. (5.1.32) and (5.1.33) to define G; and D;k. 
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More generally, we can always put the perturbed energy-momentum tensor 
in a form like that of the perturbed metric. In general, we define 5p just 
as for a perfect fluid, as the difference between 579 and —phoo, but 5p is 
not necessarily given by varying the temperature and chemical potentials 
in the formula for p that applies in thermal equilibrium. Also, in general 
we define the velocity perturbation ôu; times p + p as for a perfect fluid, 
as the difference between —ôT;ọ and phjp, and we decompose ôu; into the 
gradient of a scalar velocity potential ŝu and a divergenceless vector bul. 
Finally, we define a?ôp as the coefficient of ôy in the difference between 
dT; and pd;, again without assuming that dp is given by varying the tem- 
perature and chemical potentials in the formula for p that applies in ther- 
mal equilibrium. The other terms in ôT, denoted 4;4;7°5, dir + an, 
and rf 


j » represent dissipative corrections to the inertia tensor. That is, we 
write 


5T yj = phy +a [syp + dids + din! + din + y | , (5.1.39) 


STs phy =p) (aisu P bul’) ; (5.1.40) 


ôToo = —P hoo + 60, (5.1.41) 


where a, my , and buy satisfy conditions analogous to the conditions 


(5.1.34) satisfied by C;, Dj, and G;: 
TỌ. (5.1.42) 


V ae V _ > ie 
Oj; = jôu; =0, OT jj =0, Tij 


To repeat, these formulas can be taken as a definition of the quantities ôp, 
ôp, and ôu; = djdu + bul, as well as of the anisotropic inertia terms 1°, 
z”, and x”, which characterize departures from the perfect fluid form of 
the energy-momentum tensor. The perturbed mixed components (5.1.28) 
of the energy-momentum tensor, which are needed in the conservation laws, 
now take the form 


8T’; = Syp + bið? + Bij) + am + 7G 


ST iy = a™?(5 + p)(ad;F + aG; — aj5u— uV), (5.1.43) 
8T’; = (6 +p)(Gdu+ Sul), 8T% =—Sp, 
ôT = 38p — ôp + Vn. 


With these decompositions, and again using Eqs. (5.1.20), the Einstein 
field equations (5.1.24)-(5.1.26) and conservation equations (5.1.29) and 
(5.1.30) fall into three classes of coupled equations: 
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Scalar (compressional) modes 
These are the most complicated, involving the eight scalars £, F, A, B, dp, 
dp, x, and du. The part of Eq. (5.1.24) proportional to dik gives 
2 2_S Loy 2 os l2 | 4; 
_4n Ga [50 — 8p - v x | = 50a + Qa? + aä)E + 5V°A -5P À 
ree | ; 
—3a4À — 544V? b +4V°F. (5.1.44 
The part of Eq. (5.1.24) of the form 0;0;,.S (where S is any scalar) gives 
83x | 16x Ga aS +E+A—a2B—3aaB + 2aF + àF | =0. (5.1.45) 


The part of Eq. (5.1.25) of the form 0;S (where S is again any scalar) 
gives 
81 Ga(p + p)dj5u = —ad;E + adjA . (5.1.46) 
Eq. (5.1.26) gives 
—4nG (5p + 36p + Vx) sa ig ee Seep a Be 
2a? 2a a a? 
3. 3a, 3a 1 Zs . 
+od4 A= pe VR 4 v's. 
2 a a 2 a 
(5.1.47) 


The part of the momentum conservation condition (5.1.29) that is a 
derivative 0; is 


Be 34k PEREIS 
aj] dp + Vx + WP +u + EG u+ 56 + PE] =, 
(5.1.48) 
and the energy-conservation condition (5.1.30) is 
fe Ea tn ws ale E a 
5p + — (5p + dp) + y? |- p+ p)F +a? (5 + p)du + tas] 
lesinn 2 
+50 + p)io[34+V B| iG, (5.1.49) 
In Eqs. (5.1.48) and (5.1.49), 5p, 5p, and z$ are elements of the perturbation 
to the total energy-momentum tensor, but the same equations apply to each 


constituent of the universe that does not exchange energy and momentum 
with other constituents. 
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Vector (vortical) modes 


These involve the four divergenceless vectors G;, Ci, bul, and ny. The part 
of Eq. (5.1.24) of the form 0; V; (where V; is any vector satisfying 3; V; = 0) 
gives 


| 16r Gara!’ — aÈ; — 3aaj + aG; + 24G; =0, (5.1.50) 


while the part of Eq. (5.1.25) of the form V; (where V; is again any vector 
satisfying 0;V; = 0) gives 


l l 
81 G( + Pau; = 5V°G — SVG ; (5.1.51) 


The part of the momentum conservation equation (5.1.29) that takes the 
form of a divergenceless vector is 


: 3a 
Vin + dol (6 +u] + —( + pyduy =0, (5.1.52) 


In particular, for a perfect fluid a = 0, and Eq. (5.1.52) tells us that (Ø + 
p)ouy decays as 1/a?. In this case, both Eqs. (5.1.50) and (5.1.51) imply that 


the quantity G; — aC; (which we will see in Section 5.3 is the only physically 
relevant combination of metric components for vector perturbations) decays 
as 1/a?. Because they decay, vector modes have not played a large role in 
cosmology. 


Tensor (radiative) modes 


These involve only the two traceless divergenceless symmetric tensors Dj 
and mi . There is only one field equation here: the part of Eq. (5.1.24) 
of the form of a divergenceless traceless tensor is the wave equation for 
gravitational radiation 


-16r Garf = V?°Dij — a Dy — 3aaDj . (5.1.53) 


The above equations for scalar, vector, and tensor perturbations do not 
form a complete set. This is in part because we still have the freedom to 
make changes in the coordinate system, of the same order as the physical 
perturbations. In Section 5.3 we will see how to remove this freedom by a 
choice of “gauge.” 

But even after the gauge has been fixed, the equations for the scalar 
modes will still not form a complete set, unless the pressure p and anisotropic 
inertia 2° can be expressed as functions of the energy density p. This is the 
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case, for instance, for a constituent of the universe such as cold dark matter, 
whose particles have negligible velocities, and do not interact with other 
constituents. For such constituents, we can simply set p and 2° equal to 
zero. Things are only a little more complicated for constituents of the uni- 
verse whose particles’ velocities cannot be neglected, but which experience 
collisions that are sufficiently rapid to maintain local thermal equilibrium. 
In such cases, 7° can be neglected, and if the particles are highly relativistic 
p is simply p/3. Even where the particles are only moderately relativistic, 
the pressure in thermal equilibrium can usually be expressed as a function 
of and one or more number densities n that satisfy the condition that the 
current nu“ is conserved 


(nu")., =0. (5.1.54) 


This condition tells us that the unperturbed number density satisfies n œ 
a~>, while the perturbation satisfies 


2 (=) + Dwi + 5(34 + vB) -lyro (5.1.55) 
at \ an a? 2 a 

With p given as a function of p and n, after gauge fixing the field equations 
and conservation equations (5.1.48), (5.1.49), and (5.1.55) form a complete 
set of equations for the scalar modes. 

Similarly, even after gauge fixing, the equations for vector and tensor 
modes do not form a complete set unless we have formulas for 7’ and Ty ; 
respectively. This is no problem for perfect fluids, for which 7’ = x7 = 0. 

In the general case local thermal equilibrium is not maintained, and we 
must calculate 3p, dp, x°, 2) and x7’ by following changes in the distribu- 
tion of individual particle positions and momenta, which are governed by 
Boltzmann equations. The Boltzmann equations for photons and neutrinos 
are derived in Appendix H, and used in Chapters 6 and 7. 


5.2 Fourier decomposition and stochastic initial 
conditions 


In order to simplify our work we want to make full use of the symmetries 
of the problem. We have already used the rotational symmetry of the field 
and conservation equations in sorting out perturbations into scalar, vector, 
and tensor modes, and we will apply rotational symmetry again later in this 
section. The equations also have a symmetry under translations in space, 
which can best be exploited by working with the Fourier components of 
the perturbations. As long as we treat perturbations as infinitesimal, there 
is no coupling between the Fourier components of different wave number. 
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Further, only the initial conditions but not the equations themselves depend 
on the direction of the co-moving wave number q. 

Let us first see how this works out for the scalar modes. Because the 
unperturbed metric, energy density, etc. are independent of position, the 
general solution for the corresponding perturbed quantities may be written 
as a superposition of plane wave solutions, whose spatial variation is given 
by factors exp(iq - x), where q is a real wave vector, over which we must 
integrate. The coefficient of exp(iq - x) in each of the scalar perturbations 
A(x, t), B(x, t), etc. is a solution of coupled ordinary differential equations 
with time as the independent variable, obtained by replacing 0/>/ with ig; 
everywhere in the equations presented in Section 5.1. The differential equa- 
tions obtained in this way depend on q = |q|, but not on the direction of q, so 
the solutions can be written as superpositions of independent g-dependent 
normal modes, each characterized by a set of perturbations Ang(t), Bng(t), 
etc., with an overall normalization factor a,(q) carrying a discrete index n 
that labels the various independent solutions. These normalization factors 
depend on the initial conditions, which of course are not rotationally invari- 
ant — if they were then there would be no galaxies or stars — so the a,,(q) 
depend on the direction of q, but the solutions Ajng(t), Bng(t) can be chosen 
to depend only on q = |q|. That is, we write 


A(x, t) = )> J Ag anA He, 
B(x, t) = > J d’q AnA) Bng (He™*, (5.2.1) 


and likewise (with the same œn (q)) for E and F as well as for ôp, dp, ŝu, z5 
and any other rotational scalars. 

Now, as discussed in Section 2.6, we expect the scalar variables A(x, t), 
B(x, t), etc. to be stochastic variables, characterized by averages of their 
products. The solutions {4ng(t), Brg (t),...} are ordinary fixed functions, 
not stochastic variables, so the stochastic nature of the scalar variables arises 
from the stochastic character of the initial conditions, embodied in the fac- 
tors a,(q). Under the assumption that the scalar variables are governed 
by Gaussian distributions, of the sort discussed in Appendix E, all aver- 
ages of scalar quantities can be expressed in terms of bilinear averages 
(A(x, t) A(y, t)), (A(x, £) Bly, £), etc. Let us consider the average of the 
product of any two real scalar quantities X (x, t) and Y(y,f). It turns out 
to be very convenient to use Eq. (5.2.1) for X and its complex conjugate for 
Y, so that 


(X(%,) Yy.)) = >> | d°q J Pd Xng(t) Yig O lenak) X eY, 
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We assume that although the initial conditions are not translationally 
invariant, they are governed by a translationally invariant probability dis- 
tribution function, so that (X(x, t) Y (y, t)) should be a function only of 
x —y. This immediately tells us that (a, (q)æž, (q^) must be proportional to 
a delta function, 6°(q — q^). Furthermore, although the initial conditions 
are not rotationally invariant, we also assume that they are governed by 
a rotationally invariant probability distribution, so that the function of q 
multiplying this delta function can only depend on the magnitude q = |q| 
of the wave vector, not on its direction. That is, we can write 


(an(q)a%,(q’)) = Pam Qeq—4q), (5.2.2) 


and so 
(XK) Vy,)) = 0 J d’q Xng(t) Ying(t) Pam(Q) expliq - (x — y) . 


(5.2.3) 


The task of the theory of cosmological perturbations is twofold: to find 
the solutions Ang(t), Bng(t), etc. of the differential equations under suitable 
assumptions about the constituents of the universe, and to calculate the 
spectral functions Pym(q) in a theory of the origin of fluctuations in the 
very early universe. 

Inspection of Eq. (5.2.2) shows immediately that Pmn(q) is a Hermitian 
matrix 


Fim (4) = Pm (Q) , (5.2.4) 
and it is positive, in the sense that 
XO Pam (QD Erk, > 0 (5.2.5) 


nm 


for any set of complex numbers é, (or functions of q) that are not all zero. 
In general, for an arbitrary choice of independent solutions, there is no 
reason why Py,(q) should also be diagonal, so in general there will be inter- 
ference between the different modes. However, it is sometimes convenient 
to choose the solutions so that Pam(q) is simply equal to nm. To see that 
this can always be done, we recall a theorem of matrix algebra, which says 
that, because it is positive and Hermitian, Pam(q) can be put in the form 


Pr (Q) = > Zar DZinr(@ » (5.2.6) 


for some square matrix Z,,(qg), with r running over as many values as 
n. (That is, in matrix notation, P = ZZ‘.) We can then redefine the 
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independent solutions, defining 


Ayg(t) = XO Zur (qQ)Ang (0) > Brg (0) = XO Znr(Q)Bng) , etc., 


n 


with a corresponding redefinition of the normalization factors 
on(g) = È Znr(Q) &(Q) 
r 
chosen so that 
Aw.) = y, J Pq ări , 
r 


and likewise (with the same &,(q)) for B, E, and F, as well as for dp, dp, du, 
etc. The advantage of this is that now the relevant bilinear averages are 


(č (q)a* (q’)) = 5;s5°(q — q’) . (5.2.7) 


This is the result that was to be proved. With the solutions Arg (t),etc. chosen 
so that the a@,(q) satisfy Eq. (5.2.7), the different modes are uncorrelated. 
That is, any binary average is a sum over the individual modes. For instance, 


(A(x, 1) A(y,)) = Do J Pq ATD AO. (5.2.8) 


In this way, the problem of calculating the spectral function Pam(q) is traded 
for the problem of finding the correct linear combination of solutions for 
which Eq. (5.2.7) applies. 

Incidentally, there is no problem in calculating the averages (aa) if 
we know the averages (a,a@;,), because the a, satisfy a reality condition. 
To derive this condition, note that the differential equations for Anq(t), 
Bng(t), etc. are real, so the set of complex conjugates Ang(t), Brg), etc. 
of solution n is also a solution, and can therefore be expressed as a set of 
linear combinations `, Cnm(q)Amg(), Yom Cnm(G) Bing (t), etc. (Of course, 
in the case of a real solution, Cym(g) = ônm.) The functions A(x, t), B(x, t) 
are real, so by taking the complex conjugate of Eqs. (5.2.1) and replacing 
the integration variable q with —q we see that 


NO aO cima) = om (—@) - (5.2.9) 


In particular, it follows that 


(on (q)am(q’)) = X Prid) cm) (a+ q) . 
l 
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Now let us consider the tensor modes. These are completely charact- 
erized by the traceless divergenceless symmetric tensor Dj(x, t). (The stress 
tensor my (x, t) is not an independent dynamical quantity but, as we will 


see in Section 6.6, it is given by a linear functional of Djj—specifically, the 
solution of a linear integral equation with Dj as the inhomogeneous term.) 
If we write Dj as a Fourier integral 


Dj, t) = J dq e't* Di(q, t) , (5.2.10) 


then the Fourier transform D must satisfy the conditions 
Dy=Di, Di=0, qDj=0. (5.2.11) 


For a given wave vector q, there are just two independent matrices satisfying 
these conditions. For instance, for q in the three-direction, Eq. (5.2.11) 
requires that 


Diu =-D2, Dy2»=Py, Dg =Dz=0, (5.2.12) 


so all Dj can be expressed in terms of the two independent components D11 
and D2. (These components are frequently denoted ht = D11 and h* = 
D2.) It is convenient to classify the possible Dy by their transformation 
properties under a rotation by an angle 6 around the three-axis.! It is easily 
seen that under such a rotation, 


Di1 > cos? 0 D1; +cos8 sind Di2 + sin@ cos6 Dy, + sin? 6 Dr 
= cos26D,; + sin 20 Di2 

Di2 > cos? 0 Di2 — cos 0 sin6D + sin@ cos 8 Dx — sin? 6 D>; 
= — sin 20 Dıı + cos 20 D12, 


or more succinctly 
Dii FiDi > gue [Pu F iD12| : (5.2.13) 


For this reason, the linear combinations D11 + iD} are said to have helicity 
+2. (A wave of helicity à consists of quanta with angular momentum in 
the direction of motion equal to fd.) We will write Dj(q, £) as a sum over 
helicities: 


Dja D= $ eg(G.2)D@,0) , (5.2.14) 
A=+2 


I See, for instance, G&C, Sec. 10.2. 


232 


5.2 Fourier decomposition and stochastic initial conditions 


where for q in the three-direction 
1 
JT? 
eiz = e3; = 0, (5.2.15) 


e11 Ê, £2) = —e2(Z, 2) = Fie12(Z, £2) = Fier (2,42) = 


while for ĝ in any other direction e;(g, +2) is defined by applying on each 
of the indices į and j a standard rotation, that takes the three-direction into 
the direction of ĝ. 

For q in the three-direction, the combination D1, (q, t) F iD12(q, £) is 
proportional to D(q, +2, t), so according to the transformation rule (5.2.13), 
a rotation by an angle 6 around the direction of q changes D(q, +2, t) by a 
factor exp(+2i0). There is nothing special about the three-direction, and 
the same is true for q in any direction. 

The quantities D(q,4,f) in Eq. (5.2.14) satisfy the A-independent 
second-order field equation (5.1.53), with the Laplacian V? replaced with 
—q? = —q: 
qg 


DaD + 3D, ) + ED@q, 4,1) = 16rGx7(q,2,0), (5.2.16) 
a 


where x 7 (q, A, t) is the Fourier transform of the tensor part of the anisotropic 
inertia tensor: 


aiad [Pace Gox7 6217 
A=t2 


As already mentioned, z/ (q, A, t) isa helicity-independent linear functional 
of Dq, A, t). Therefore, just as in the absence of anisotropic inertia, there 
are two independent solutions (distinguished by a label N) that, aside from 
normalization factors 6, are independent of à and of the direction of q. 
Thus we can write 


Dq.r,t) = D> Bn. XD) , (5.2.18) 
N 


and 


m7 (q,A,t) = >> Bn a Nai) , (5.2.19) 
N 


the sum over N running over the labels of the two independent solutions 
Dyzg(t) of the field equation 


: 2 
Byg(t) + 3—Dyg(d) m L Dul) =16rGrf (t). (5.2.20) 
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Putting together Eqs. (5.2.10), (5.2.14), and (5.2.18), we have 


Dy, = >> J d’q e!4* ep, A) BNA, Dna. (5.2.21) 


The normalization factors By (q, à) like the æn (q) are stochastic variables. 
The translational invariance of the probability distribution governing these 
factors tells us that (By (q, à) By, (q', A’)) is proportional to 6°(q—q’). Under 
a rotation by an angle 0 around the direction of q, the product By (q, à) 


Bx, (q, 4’) is changed by a factor exp (i0 (A—A’) ), so the rotational invariance 


of the probability distribution requires further that (By (q, à) Bx, (q’,A’)) 
is also proportional to ôx. For the most part, we will also assume that 
the probability distribution is invariant under the space-inversion opera- 
tor P. This operator reverses the direction of momentum but not of angular 
momentum, so it changes the sign of the helicity, and therefore with this 
assumption, aside from the factor ô}, the mean value (By (q, A) Bx, (q', 4’) 
is independent of helicity.? We have then 


(By (a, A) Bx (al, A) = PECA) Say 0°(q — q’) (5.2.22) 


Rotationaland translational invariance also tell us that there is no correlation 
between the normalization factors for scalar and tensor modes: 


(By (q, A) a (q')) = 0. (5.2.23) 


Just as in the case of scalar modes, it is always possible to choose the two 
tensor modes so that P¥w (q) is just vv’, but we need to know the probabil- 
ity distribution governing initial conditions in order to decide which linear 
combination of modes have this property. 

The average of a product of two tensor perturbations is given by 
Eqs. (5.2.21) and (5.2.22) as 


OE NDGA = Be dq PEN (g) lO 
NN’ 


x Ti kel Png ODA g(t), (5.2.24) 


2The same conclusion follows if we assume that the probability distribution is invariant under CP, 
where C is the charge conjugation operator. Neither P nor CP is an exact symmetry of nature, but 
CP-invariance is a better approximation than P-invariance. The product CPT (where T is the reversal 
of the direction of time) is an exact symmetry in any quantum field theory, but T-invariance is broken 
by the expansion of the universe. 
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where? 
Mykel) = X ey, A) eke, A) 
Xr 


= Six dje + Siedjx — SiSke + Sijke + Skeĝiĝj 
— Si Gide — Sie dk — Sik dide — Sjeĝiĝk + iĝjåkåe. (5.2.25) 
Formulas like Eqs. (5.2.24) and (5.2.3) will be applied to the cosmic 
microwave background in Section 7.4. 

We will see in Section 5.4 that the anisotropic inertia zy, becomes neg- 
ligible during the era when the physical wave number q/a is much less than 
the expansion rate a/a. In this case, one tensor mode becomes dominant. In 
the absence of anisotropic inertia, the gravitational wave equation (5.2.20) 
in the limit g/a K H becomes 


Dyq(t) T a =0. 


This has two obvious solutions: 
Cc dt’ 
a(t’) ` 


Dig(t)=1, Dal) = J 
t 


(The integral in the second solution converges, because a(t) œ t!/? in the 
radiation-dominated era, and grows even faster in the matter-dominated 
and vacuum-dominated eras.) Since D2,(t) — 0 at late times, for generic 
initial conditions the gravitational waves will eventually be dominated by the 
first solution. Thus in this case, to evaluate bilinear averages like (5.2.24), 
we only need to know P7,"" (q). Alternatively, we could take Dj,(1) to be a 
constant Dj (the o superscript denoting “outside the horizon”), chosen so 


that PA (q) = 1. In Section 5.4 we will see that similar remarks usually 
apply to the scalar modes. 


5.3 Choosing a gauge 


The equations derived in Section 5.1 have two unsatisfactory features. First, 
even with the simplifications introduced by decomposing the equations into 


3To obtain the final formula for My ke(ĝ), one can use the conditions that Ij ke (ĝ) is a tensor 
function of ĝ, symmetric in i and j and in k and £, that Ny pe (ĝ) = TZ, j® and that gM ke ĝ) = 0, 
to show that Tj ke (q) is proportional to the quantity on the right-hand side of Eq. (5.2.25). The constant 
of proportionality can then be found by using Eq. (5.2.15), letting ĝ be in the three-direction and setting 
i,j,k, and £ all equal to 1. 
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scalar, vector, and tensor modes, the equations for the scalar modes are 
still fearsomely complicated. Second, among the solutions of these equa- 
tions are unphysical scalar and vector modes, corresponding to a mere 
coordinate transformation of the unperturbed Robertson—Walker metric 
and energy-momentum tensor. We can eliminate the second problem and 
ameliorate the first by fixing the coordinate system, adopting suitable con- 
ditions on the full perturbed metric and/or energy-momentum tensor. We 
will deal here with the coordinate-dependent perturbations A(x, t), B(x, t), 
etc., but all of the results of this section could be applied just as well to the 
Fourier components Ang (t), Bng(t), etc., by simply replacing each Laplacian 
with —q?. 
Consider a spacetime coordinate transformation 


x > x" = x 4 M(x), (5.3.1) 


with e“(x) small in the same sense that h,,, 50, and other perturbations 
are small. Under this transformation, the metric tensor will be trans- 
formed to 


Ox Ox 
x/l ax!Y à 


Ein) = 8u (x E (5.3.2) 
Instead of working with such transformations, which affect the coordi- 
nates and unperturbed fields as well as the perturbations to the fields, it 
is more convenient to work with so-called gauge transformations, which 
affect only the field perturbations. For this purpose, after making the coor- 
dinate transformation (5.3.1), we relabel coordinates by dropping the prime 
on the coordinate argument, and we attribute the whole change in guy (x) 
to a change in the perturbation h,,,(x). The field equations should thus 
be invariant under the gauge transformation h(x) > huv (x) + Ahu (x), 
where 


Ahyy(x) = Zin — gm (xX), (5.3.3) 


with the unperturbed Robertson—Walker metric 8, (x) left unchanged, and 
corresponding gauge transformations of other perturbations. To first order 
in €(x) and huv (x), Eq. (5.3.3) is 


Ahyy (x) = Syy(X') — 


OS uv (Xx x) e a 
Oe: (x) — Suv (x) 


ð ðe’ dZ 
Bau) E D -iuw v. Sir wa 


(x), 
(5.3.4) 
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or in more detail 


dci eG 
dE; d0€9 a 
Ahio = — — — + 2-6, 5.3.6 
ae ot ox! T Pi ( ) 
0€0 
Ahoy = —2—, 5.3.7 
00 F ( ) 


with all quantities evaluated at the same spacetime coordinate point, and 
indices now raised and lowered with the Robertson—Walker metric, so that 
co = —e? and e; = ae’. The field equations will be invariant only if 
the same gauge ifansformation is applied to all tensors, and in particular 
to the energy-momentum tensor, so that we must transform ôT (x) > 
bTyy(x) + AdT v(x), where A ôT, is given by a formula! analogous to 


Eq. (5.3.4): 


= ao ITa x) 
A8Tyy = -Bu — Thy) a e) , (5.3.8) 
or in more detail 

AsTy=—p (= a4 =<) if 7 (a P) Bre (5.3.9) 
de; _de0 

A6T yy = —P ~ +p 7 +285 in (5.3.10) 
d€0 

A ô Too = 2p" + peo. (5.3.11) 


Note that we use 6 to signify a perturbation, while A here denotes the change 
in a perturbation associated with a gauge transformation. 

In order to write these gauge transformations in terms of the scalar, 
vector, and tensor components introduced in Section 5.1, it is necessary to 
decompose the spatial part of e” into the gradient of a spatial scalar plus a 
divergenceless vector: 


6 = de +e), def =0. (5.3.12) 


Then the transformations (5.3.5)-(5.3.7) and (5.3.9)-(5.3.11) give the gauge 
transformations of the metric perturbation components defined by 


'The right-hand sides of Eqs. (5.3.4) and (5.3.8) are known as the Lie derivatives of the metric and 
energy-momentum tensor, respectively. For a discussion of Lie derivatives, see G&C, Secs. 10.9 and 
12.3. 
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Eqs. (5.1.31)-(5.1.33): 


2 2 
ie maser AB=-e*, 
a 
1 
AC; = -=el , ADj = 0, AE =2ċė0, (5.3.13) 
j i 


1 24 1 24 
AF = — (-« ave? Ae Aes) , AG=- (e + ag) ; 
a a a a 
and of the perturbations (5.1.39)-(5.1.41) to the pressure, energy density, 
and velocity potential 
Aép = peo , Adp= peo , Adu=—eEq. (5.3.14) 


The other ingredients of the energy-momentum tensor are gauge invariant: 


Ag” = he! = Ang = Adul = 0: (5.3.15) 
Se. nV = Hf 
fluid and the condition buy = 0 for potential (Ze., irrotational) flow are 
gauge invariant. 

For the field equations to be gauge-invariant, similar transformations 
must of course be made on any other ingredients in these equations. For 
instance, any four-scalar s(x) for which s’(x’) = s(x) under arbitrary four- 
dimensional coordinate transformations would undergo the change 
A ôs(x) = s(x) — s(x) = s'(x) — 5'(x’), which to first order in pertur- 
bations is 


Note in particular that the conditions z = 0 for a perfect 


Os(t) 


J etx) = 5e . (5.3.16) 


A s(x) = s(x) — s(x) = — 


This applies for instance to the number density n or a scalar field g. For a per- 
fect fluid both p and p are defined as scalars, and the gauge transformations 
in Eq. (5.3.14) of êp and dp are other special cases of Eq. (5.3.16). Like- 
wise, for a perfect fluid the gauge transformation in Eq. (5.3.14) of ôu can 
be derived from the vector transformation law of u,. Because the gauge 
transformation properties of 5p, dp, ŝu, etc. do not depend on the conser- 
vation laws, Eqs. (5.3.14) and (5.3.15) apply to each individual constituent 
of the universe in any case in which the energy-momentum tensor is a sum 
of terms for different constituents of the universe, even if these individual 
terms are not separately conserved. 

We can eliminate the gauge degrees of freedom either by working only 
with gauge-invariant quantities, or by choosing a gauge. The tensor 


2J, M. Bardeen, Phys. Rev. D 22, 1882 (1980). 
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quantities mp and Dj appearing in Eq. (5.1.53) are already gauge invariant, 
and no gauge-fixing is necessary or possible. For the vector quantities x, 
bul’, C; and G;, we can write Eqs. (5.1.50)-(5.1.52) in terms of the gauge- 
invariant quantities xy, buy and G; = G; — aC;, or we can fix a gauge for 
these quantities by choosing ey so that either C; or G; vanishes. (Note that 
G; is the vector field that we showed in Section 5.1 to decay as 1/a? in the 
absence of anisotropic inertia.) For the scalar perturbations it is somewhat 
more convenient to fix a gauge. There are several frequently considered 
possibilities. 


A. Newtonian gauge 


Here we choose eù so that B = 0, and then choose €o so that F = 0. Both 
choices are unique, so that after choosing Newtonian gauge, there is no 
remaining freedom to make gauge transformations. It is conventional to 
write E and A in this gauge as 


E=20, A=-20, (5.3.17) 


so that (now considering only scalar perturbations) the perturbed metric 
has components 


go =-1-20, go =0, gy=a°dj[1—2¥]. (5.3.18) 
The gravitational field equations (5.1.44)-(5.1.47) then take the form 


—4r Ga? [80 — dp — Vx] = aa® + (4a7 + 2aa)® — Vw + aw 


+6aa , (5.3.19) 
—81 Ga’ ðið = 99 — Y], (5.3.20) 
An Ga(p + p)djsu = —49;0 — ad; , (5.3.21) 
1 3a . 
4nG (5p 4+ 38p + Vas) =+5V°6+ “6 
a a 
x 6a . 64 
Üt ito, (5.3.22) 


and the equations (5.1.48) and (5.1.49) of momentum and energy conser- 
vation become (aside from modes of zero wave number) 


3a 
dp+V7n* + dol(6 + psu] + (P+ Piu+ @+P)P=0, (5.3.23) 


36 3 . 
TEETER [ere Ipu a5] -3(6 +p) =0. (5.3.24) 
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In particular, Eq. (5.3.20) shows that ® and Y are not physically independent 
fields; they differ only by a term arising from the anisotropic part of the stress 
tensor, and in particular they are equal for a perfect fluid, for which x5 = 0. 
The perturbation to the number density of a species of particle whose total 
number is conserved will satisfy relation (5.1.55), which in Newtonian gauge 
reads 


2 (=) + Lisy -3ý =0. (5.3.25) 
at \ A a? 

Given an equation of state for p as a function of p (or, if p depends also on 
other quantities like n, then given also field equations for those quantities) 
and given also a formula for x° as a linear combination of the other pertur- 
bations (such as for instance the formula 2° = 0 for a perfect fluid) we can 
regard Eqs. (5.3.21), (5.3.23), and (5.3.24) (and, where needed, Eq. (5.3.25)) 
as equations of motion for Y, ŝu, and dp, respectively, with ® given in terms 
of Y by Eq. (5.3.20). The remaining equations provide a constraint on the 
solution of this coupled system of equations. By subtracting 3/a? times 
Eq. (5.3.19) from Eq. (5.3.22) and then using Eqs. (5.3.20) and (5.3.21) to 
eliminate 1° and ©, we find that? 

a 


asp — 3Ha (5 + pou — (z 


) Vy =0. (5.3.26) 


This is aconstraint rather than an equation of motion, because the equations 
of motion (5.3.21), (5.3.23), and (5.3.24) imply that the left-hand side of 
Eq. (5.3.26) is time-independent, so that Eq. (5.3.26) only has to be imposed 
as an initial condition. 


B. Synchronous gauge 
Here we choose €9 so that E = 0, and then choose eù so that again F = 0. 


Considering only scalar perturbations, the complete perturbed metric is 
then 


a°B 


dx'dx/ 
In this gauge, the Einstein field equations (5.1.44)-(5.1.47) take the form 
2 2_S lo, los sao b sogi 
—4r Ga [50—sp—V T | = aN A-sa A—3aaA—~aav B, (5.3.28) 


—16r Gan’ = A—a?B — 3aåb , (5.3.29) 
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81 Ga(p+p)su=aA , (5.3.30) 
36)... 3a-% E E / 3 
—4rG (5p +38p + vx) = 24+ Zit vB + EVR, (5.3.31) 
2 a 2 a 
and the equations (5.1.48) and (5.1.49) of momentum and energy 
conservation read 


m 
DEN nE + aol (6 + B)Su] + <e +p)su=0, (5.3.32) 
if . 
Sb + Lp + 5p) + V? |o + pou + tas] 
are 2 
+50 + p)d]34 +V B] iy, (5.3.33) 


Note that in this gauge the equation of momentum conservation, which 
furnishes the equation of motion (the Navier-Stokes equation) for an 
imperfect fluid, does not depend at all on the perturbed metric, while the 
equation of energy conservation may be written as 


34 A 
ôb + = (5p + 6p) + V? [>o + p)du + ias] 
+(pt+p)y =0 (5.3.34) 


where 


| aay . Ə (hü 

= ~34+V*B] = —(—). 5.3.35 

v= BA+VŻI a (55 (5.3.35) 

We need A and B separately to calculate the motion of individual parti- 

cles, but the effect of gravitation on a perfect or imperfect fluid is entirely 

governed by the quantity y. Inspection of the field equation (5.3.31) shows 

that it provides a differential equation for just this combination of scalar 
fields: 


—4r Ga? (5p + 3ôp + vêz") = (ey) (5.3.36) 


Also, in synchronous gauge the equation (5.1.55) for particle conservation 
takes the form 
ð (6 
- (=) +aV26u+y =0. (5.3.37) 
ot \n 
Given an equation of state for p as a function of p (and perhaps n) and 
a formula expressing 2° as a linear combination of the other scalar per- 
turbations we can use Eqs. (5.3.32), (5.3.34), (5.3.36) (and perhaps 
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Eq. (5.3.37)) to find solutions for the three independent perturbations ŝu, 
6p, and y, respectively. The left-over equations (5.3.28)-(5.3.30) are not 
needed, for a reason given in Section 5.1: the full set of equations (5.3.28)— 
(5.3.33) are not independent, because the equations of energy and momen- 
tum conservation can be derived from the Einstein field equations. 

If we need to know A and B separately we can find them from y and ép. 
By adding 3 times Eq. (5.3.28), plus 1/2 the Laplacian of Eq. (5.3.29), plus 
a times Eq. (5.3.31), we obtain the simple relation 


V?A = -8r Gasp + 2Ha y , (5.3.38) 


where as usual H = å/a. After A is found in this way, we can find B from 
A and w by solving Eq. (5.3.35). 

Synchronous gauge was widely used in early calculations of the evolution 
of perturbations in cosmology, starting with the ground-breaking work 
of Lifshitz in 1946.3 In the 1980s synchronous gauge became unpopular, 
because of a feature emphasized by Bardeen:? even after we impose the 
conditions E = F = 0, we are left with a residual gauge invariance. We can 
see from Eq. (5.3.13) that E and F are not affected by a gauge transforma- 
tion with 


Oox, D =—t(x), Ex N= POT) | a*(t)dt, (5.3.39) 
where t(x) is an arbitrary function of x, but not of t. But under this trans- 
formation A and B do change 

z: 
Ma ABA J a? (À dt . (5.3.40) 


a 


In particular, the combination (5.3.35) undergoes the change 
d (i 
EEE (<) — avr. (5.3.41) 
a 
Also, the changes in the perturbations to the energy density, pressure, and 
velocity potential are given by Eqs. (5.3.14) and (5.3.39) as 
Aôp = —pt , Adp=—pt, Adu=t, (5.3.42) 


while x° is invariant. The same transformation rules apply for any one 
of the individual constituents of the universe. Any scalar perturbation ôs 
such as the number density perturbation ôn or a scalar field perturbation 


3E, Lifshitz, J. Phys. USSR 10, 116 (1946). Also see G&C, Sec. 15.10. 
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ôg undergoes a change like that of the pressure and density perturbations: 
As = —St . (5.3.43) 


The reader can check that all of the equations (5.3.28)-(5.3.34) and (5.3.37) 
are invariant under these residual gauge transformations. This being the 
case, for any solution y, dp, 5p, du, dn, etc. of these equations there will 
be another solution w + Ay, êp + Adp, ôp + Adp, du + Adu, n + Aédn, 
etc., and since the field equations are linear, this means that Aw, Adp, Aédp, 
Aéu, Aén, etc., is also a solution. (For this solution there is no scalar 
anisotropic inertia, because x% is gauge invariant.) This is a nuisance, 
because in finding solutions of the field equations we keep having to check 
that our solution represents a physical disturbance, not a mere change of 
gauge. 

However, this is not a problem if we can remove the residual gauge sym- 
metry in any natural way. This is the case if the universe contains a fluid (such 
as cold dark matter) whose individual particles are moving at speeds much 
less than that of light. In this case, the space-space components T 5 of the 
energy-momentum tensor for that fluid are negligible, so we can take pp = 0, 
pp x a7?, and ôpp = TS = 0 in the equation (5.3.32) of momentum con- 
servation for this fluid, which tells us then that dup is time-independent. 
According to Eq. (5.3.42) (which applies separately to each constituent 
of the universe) a time-independent velocity potential dup can always be 
removed in synchronous gauge by a residual gauge transformation, with t = 
—dup. There is then no longer any ambiguity in the choice of gauge. These 
features make synchronous gauge particularly convenient in dealing with 
the later stages of cosmological evolution, when cold dark matter plays a 
prominent role. 


C. Newtonian/synchronous conversion 


We will find it convenient to do calculations using Newtonian gauge in some 
eras, and synchronous gauge in others. To connect results for different eras, 
we need to be able to convert them from one gauge to another.4 

Suppose first that we begin in Newtonian gauge, and make an 
infinitesimal coordinate transformation x” > x” + e”, with e; = ðe! 
so as not to induce vector perturbations. According to Eqs. (5.3.13) and 
(5.3.17), in order to give E the synchronous gauge value E = 0, we need 


čo = —® (5.3.44) 


4See, for instance, C.-P. Ma and E. Bertschinger, Astrophys. J. 455, 7 (1995). 
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Then according to Eq. (5.3.13), to keep F = 0 we need 
a («> €0 
ies ee 5.3.45 
at (5) a? ( ) 
The A and B components of the spatial metric in synchronous gauge are 


given by Eqs. (5.3.13) and (5.3.17): 


2 
A=-2W+2He, B=-56", (5.3.46) 
a 
where H = a/a. In particular, the field y is 


ə v2 
fo S 3—(H €o) 4: x (5.3.47) 


Also, Eq. (5.3.14) allows us to calculate the synchronous gauge pressure 
perturbation dp*, energy density perturbation 6p*, and velocity potential 
du’ from the corresponding quantities 5p, 60, and ôu in Newtonian gauge: 


pS =8p+ eop, dp’ =p +eco, Su = őu— ep. (5.3.48) 


Given ® we can calculate €o from Eq. (5.3.44), and then given Y we can 
obtain w from Eq. (5.3.47) and the synchronous gauge pressure, energy 
density, and velocity potential perturbations from Eq. (5.3.48). The quan- 
tity €o is determined by Eq. (5.3.44) only up to a time-independent function 
of position, so the values of the synchronous gauge quantities A, B, w, 
Pp, Ò, and du are only determined up to a residual gauge transformation 
(5.3.40)-(5.3.42). 

Next suppose that we begin in synchronous gauge, with metric fields A 
and B, and want to convert to Newtonian gauge. According to Eq. (5.3.13), 
to make gj; proportional to 6; we need to take 


=a B/2. (5.3.49) 
Then to keep gio = 0, Eq. (5.3.13) tells us that we must take 
co = —a°B/2. (5.3.50) 


Using Eq. (5.3.13) again together with the definitions (5.3.18), we have 
then 


l EPET 

a sz (4 b) , (5.3.51) 
l 2a 1 aa. 

We" (As Sean Re 3.52 
;( + te) 5 + 7 (5.3.52) 


5.4 Conservation outside the horizon 


In contrast to the previous case, Eqs. (5.3.51) and (5.3.52) give ® and Y 
uniquely. Not only that — it is easy to see that the results for ® and Y are 
unaffected if the A and B with which we start are subjected to the residual 
gauge transformation (5.3.40). 


D. Other gauges 


In choosing a gauge, it is not necessary to impose conditions only on the 
scalar fields appearing in the metric tensor.> Instead, some of the gauge 
conditions can impose constraints on the scalars appearing in the energy- 
momentum tensor. For instance, in co-moving gauge we choose €o so that 
ôu = 0 (which for scalar perturbations makes the velocity perturbation 
du’ vanish). Where the only “matter” is a single scalar field, as in popu- 
lar theories of inflation, this means that the time coordinate is defined so 
that at any given time the scalar field equals its unperturbed value, with 
all perturbations relegated to components of the metric. In the constant 
density gauge we choose €o so that 60 = 0. In either case, after fixing 
co we can make F vanish with a suitable choice of ef, so that the scalar 
perturbations still have g;ọ = 0. Note that although this procedure fixes 
€o, it only fixes «* up to terms of the form a° (r(x), so these gauges 
share the drawback of synchronous gauge, of leaving a residual gauge 
symmetry. 


5.4 Conservation outside the horizon 


The perturbations that concern us are believed to have originated in quan- 
tum fluctuations during an era of inflation in the very early universe, dis- 
cussed in Chapter 10. That early time and the much later time when 
these perturbations are observed are separated by a time interval in which 
the equations governing perturbations are not well known. For instance, 
at the end of the era of inflation there was a time of so-called reheat- 
ing, during which the energy of the vacuum was transferred to ordinary 
matter and radiation, but we have no idea what particles were first cre- 
ated during reheating or how the energy transfer took place. Later, there 
was presumably a time when some particles effectively stopped interacting 
with the rest of matter and radiation, and became what is now observed 
as cold dark matter, but we can only guess when this was and how it 
happened. 


5A great variety of different gauges are described by H. Kodama and M. Sasaki, Prog. Theor. Phys. 
Suppl. 78, 1 (1984). 

6For instance, this gauge was used in calculations of non-Gaussian corrections to cosmological 
correlations by J. Maldacena, J. High Energy Phys. 0305, 013 (2003) [astro-ph/0210603]. 
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The only reason that we are able to use inflationary theories to make any 
predictions at all about observable perturbations despite these uncertainties 
is that the wavelengths of the perturbations that concern us were outside the 
horizon during a period that extends from well before the end of inflation 
until relatively near the present (and hence includes the times of reheating 
and cold dark matter decoupling), in the sense that the physical wave number 
q/a during this period was much less than the expansion rate H = å/a. 
During inflation q/a decreases with time more-or-less exponentially while 
H is roughly constant, so all perturbations originally had g/a >> H but 
except for very short wavelengths eventually have g/a « H. Then during 
the radiation and matter-dominated eras a increases like t!/ or 17/3 while 
H falls like 1/t, so except for the longest wavelengths the perturbations 
that had g/a « H at the end of inflation eventually again have g/a > H. 
During the intervening period when perturbations were outside the horizon, 
the scalar and tensor fluctuations were subject to certain conservation laws, 
that allow us to connect the distant past to the relatively recent past. 

As we shall see below, for scalar modes it is the quantity defined in 
Newtonian gauge by! 


that in certain circumstances is conserved outside the horizon. (Recall 
that ŝu is the velocity potential for the total energy-momentum tensor.) 
Equivalently, there is another quantity? 


gadi i (5.4.2) 
3(p +P) 
that according to the constraint (5.3.26) is related to R, by 
2 
q Y 
ta = Rq 1 (5.4.3) 


— 12rG(5 +p)a ` 


The difference is of relative order (q/aH)?, so for q/a « H the quantity bq 
is conserved outside the horizon if Rq is. 

The importance of R, in the work of ref. 1 is that (as will be shown 
below) it is conserved outside the horizon in inflation driven by a single scalar 


'The constancy of was noted in various special cases by J. M. Bardeen, Phys. Rev. D 22, 1882 
(1980); D. H. Lyth, Phys. Rev. D 31, 1792 (1985). For reviews, see J. Bardeen, in Cosmology and Particle 
Physics, eds. Li-zhi Fang and A. Zee (Gordon & Breach, New York, 1988); A. R. Liddle and D. H. 
Lyth, Cosmological Inflation and Large Scale Structure (Cambridge University Press, Cambridge, UK, 
2000). 

2J, M. Bardeen, P. J. Steinhardt, and M. S. Turner, Phys. Rev. D 28, 679 (1983). This quantity was 
re-introduced by D. Wands, K. A. Malik, D. H. Lyth, and A. R. Liddle, Phys. Rev. D 62, 043527 (2000). 
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field. But this is not enough for our purposes. After inflation, the universe 
becomes filled with a number of types of matter and radiation, and Ry is 
not necessarily conserved outside the horizon under these circumstances. 
What we need is a theorem that says that, whatever the constituents of the 
universe, there is always a solution of the field equations for which Rq and ¢g 
are conserved outside the horizon. Such a solution is called “adiabatic,” for 
reasons explained later. If cosmological fluctuations are described by such 
a solution during inflation, then they will continue to be described by such 
a solution and Rq and ¢, will remain constant as long as the perturbation 
is outside the horizon, because this is a solution under all circumstances. 

This result (and a corresponding result for tensor modes) is contained 
in the following theorem:? Whatever the contents of the universe, there are 
two independent adiabatic physical scalar solutions of the Newtonian gauge 
field equations for which the quantity Rq is time-independent in the limit 
q/a < H, and there is one tensor mode for which the tensor amplitude Dg 
is time-independent in the limit g/a « H. In this limit one of the scalar 
modes has R, Æ 0; the scalar metric components are 


PIO = WA = Ry |- 1 + zo f att) | : (5.4.4) 
the perturbation to any four-scalar s(x) (such as the energy density, pressure, 


inflaton field, etc.) is given by 


Rg S(t) 
a(t) 


and the perturbation to the velocity potential is 


t 
ôs (Ð) = — a(t’) dt’ ; (5.4.5) 


R t 
ug (t) = D l a(t’) dt’ . (5.4.6) 


where 7 is an arbitrary initial time, the same in all integrals. The other 
scalar mode has R4 = 0, and 


C,H (t) Cy S(t) Ca 
DO = WA = 2 —— , s=- , Sult) = —* , (5.4.7) 
a á a(t) i a(t) MNRE aie) 
where Cy is Hg ndppenarnt, For q/a «&« H, the anisotropic inertia 
components nS V andr? vanish in both adiabatic scalar modes and in 


Tig? ijq 
the conserved tensor mode, even when some mean free times are comparable 


38. Weinberg, Phys. Rev. D 67, 123504 (2003) [astro-ph/0302326]. Also see the appendix of S. 
Weinberg, Phys. Rev. D 69, 023503 (2004) [astro-ph/0306304]. 


247 


5 General Theory of Small Fluctuations 


with the Hubble time. These are physical solutions for scalar as well as ten- 
sor modes, because the choice of Newtonian gauge leaves no remaining 
gauge freedom. 

The proof of the theorem is based on the observation that in the special 
case of a spatially homogeneous universe, the coordinate space Newtonian 
gauge field equations and dynamical equations for matter and radiation (as 
well as the condition of spatial homogeneity) are invariant under coordi- 
nate transformations that are not symmetries of the unperturbed metric.* 
In Newtonian gauge, general first-order spatially homogeneous scalar and 
tensor perturbations to the metric take the form 


ho = —20(t), ho =0, hy = —28ja? AVA + PODIO) , 


with Dj; subject to the condition that D; = 0. Spatial homogeneity also 
requires that all pressures, densities, velocity potentials, etc. are functions 
only of time. The Newtonian gauge field equations for these spatially 
homogeneous perturbations are necessarily invariant under those gauge 
transformations of the form (5.3.4)-(5.3.7), (5.3.14)-(5.3.16) that preserve 
the conditions for Newtonian gauge and spatial homogeneity. Eq. (5.3.7) 
shows that in order for hoo to remain spatially homogeneous, €o must be of 
the form 

ox, 1) = €(1) + x(x), 
so that 

A®=€E 

Eq. (5.3.6) then shows that in order for the Ajo to remain equal to zero, €; 
must have the form 


dx(x) f dt 


Ex D= POO- POT) a 


Eq. (5.3.5) then shows that 


dfi Əh , 3? x dt 
tyre ee i J A = = 
os (= i vn) + 2sjaal¢ is x] T Əxi J a 


In order not to introduce any x-dependence in hj, we must take x constant, 
in which case by shifting it into e it can be taken to be zero, and we must 
also take f; to have the form f;(x) = wx’, with wj a constant matrix. (An 
x-independent term in f; would have no effect on the metric or anything 


4Tn this respect, the theorem proved here is similar to the Goldstone theorem of quantum field theory; 
see J. Goldstone, Nuovo Cimento 9, 154 (1961); J. Goldstone, A. Salam, and S. Weinberg, Phys. Rev. 
127, 965 (1962). The modes for which Rg or Dg are constant outside the horizon take the place here of 
the Goldstone bosons that become free particles for long wavelength. 
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else, and so can be ignored here.) Then 
Ah; = -0 [wi + wji] + 2yade . 


We compare this with Eq. (5.1.33) (with the Newtonian gauge conditions 
B = Ci = 0 and A = —2W), which gives 


Ahi = —2a° 6 AY + a ADj 


Matching the terms in Ah; that are either proportional to 6; or traceless 
gives 
1 
AW = zei — He 5 


AD jy = —øij — wji + jon . 

(Note that an antisymmetric term in wj would have no effect, because 
the unperturbed metric is invariant under three-dimensional rotations.) 
The corresponding gauge transformations of the quantities appearing in the 
energy-momentum tensor and of general scalars are given by Eqs. (5.3.14)— 
(5.3.16). Since {huv, Tuv} and {Apyt+Ahyv, Tuv+AT yy} are both solutions 
of the field equations and conservation equations, their difference must also 
be such a solution. We conclude that there is always a spatially homoge- 
neous solution of the Newtonian gauge field and conservation equations, 
with scalar perturbations 


Y = He -a;i/3, P=—-E (5.4.8) 
ôp = —pe , o= — pe , ÔUS=E, > =0 (5.4.9) 
and more generally for any four-scalar s 


és = —Se. (5.4.10) 


(The reader can check that when we drop all spatial gradients, then the 
perturbations (5.4.8)-(5.4.10) satisfy the Newtonian gauge field equations 
(5.3.19)-(5.3.22), the conservation laws (5.3.23)-(5.3.24) and (5.3.25), and 
the constraint (5.3.26).) There is also a spatially homogeneous solution with 
a tensor perturbation 


1 
Dij X wij — 35ijOkk » my =0. (5.4.11) 
(This is obviously a solution of Eq. (5.1.53), but it includes the not so obvious 
information that the equations that determine Ty necessarily give mi =0 
for Dj constant, even if some particle collision rates become comparable 


249 


5 General Theory of Small Fluctuations 


with H.) Equivalently, Eqs. (5.4.8)-(5.4.11) are solutions for the Fourier 
transforms of the perturbations with zero wave number. 

So far, € is an arbitrary function of time, and œ; is an unrelated arbi- 
trary constant 4 x 4 matrix. But these are just gauge modes for zero wave 
number. On the other hand, if they can be extended to non-zero wave 
number they become physical modes, since the choice of Newtonian gauge 
leaves no residual gauge symmetries except for zero wave number. For 
the tensor modes there is no problem; in this case there are no field equa- 
tions that disappear for zero wave number, so the solution with Dj time- 
independent automatically has an extension to a physical mode for non-zero 
wave number. But matters are more complicated for the scalar modes. 

For the scalar modes the field equation (5.3.20) disappears in the limit 
of zero wave number, so to get a physical mode we must impose on the 
perturbations the condition 


=v, (5.4.12) 


(The condition ŝu = e that is required for Eq. (5.3.21) to be satisfied for 
q # 0 is already satisfied, according to Eq. (5.4.9).) Inserting Eq. (5.4.8) in 
(5.4.12) gives a differential equation for e: 


è = —He 40473 (5.4.13) 


Also, Eq. (5.4.8) for Y and Eq. (5.4.9) for ôu give the quantity R defined by 
Eq. (5.4.1) the time-independent value 


R = @kk/3 . (5.4.14) 
There is a general solution of Eq. (5.4.13) for e(t) with R 4 0: 


t 
e(t) = =| a(t’) dt’ . (5.4.15) 
a(t) Jr 


with 7 arbitrary. Using Eq. (5.4.15) in Eq. (5.4.8) gives the explicit solution 
(5.4.4) for large wavelengths 


H(t) f! 
V=O=R |- + a a(t’) ar | ; (5.4.16) 
a(t) JT 
Eq. (5.4.9) gives 
ô ô R f! 
ee ea -5f a(t’) dt’ , (5.4.17) 
pD P a(t) JT 
and more generally for any four-scalar Eq. (5.4.10) gives 
ô R f! 
SA | ajde. (5.4.18) 
s alt) JT 
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Because for any T, Eq. (5.4.15) isa solution of Eq. (5.4.13) with the same 
value of R, the difference of two of these solutions with different values of 
T is also a solution, but with R = 0. For this solution 


C 
= — 4.1 
e(t) Fite (5.4.19) 
with C another constant, and 
CH(t) 
Y = 6 = —— 5.4.20 
aD” ( ) 
L E T (5.4.21) 
p p 5 a(t) 


and likewise for individual constituents of the universe. (Since a(t) increases 
and H(t) decreases with time, this is a decaying mode, which is usually 
assumed to play no significant role at late times.) Note in particular that 
these scalar modes have equal values for 8px / Py for allindividual constituents 
a of the universe, whether or not energy is separately conserved for these 
constituents. For this reason, such perturbations are called adiabatic.’ 
For the same reason, any other solutions are called entropic. (Sometimes 
such other solutions are called zero-curvature modes, but this is misleading, 
because setting ® = Y = 0 does not usually give a solution at all.) 

We have shown that for sufficiently small wave number (in practice, this 
means q/a < H) there are always two adiabatic physical solutions for scalar 
perturbations that take the form (5.4. 16)—(5.4.18) and (5.4.20)-(5.4.21), with 
R and C arbitrary constants, anda physical solution for tensor perturbations 
that takes the form (5.4.11) with w; an arbitrary constant matrix. Since the 
equations we have solved are homogeneous, it follows that there are also 
solutions of the same form for which R, C, and wj are arbitrary time- 
independent functions of q. This concludes the proof.® 


5For instance, if several constituents (such as an electron—positron plasma, and photons) each have 
a density and pressure that depends (even when perturbed) only on the temperature, then 5p (T) = 
pi (T)5T and dõa(T)/dt = pl,(T)T, so ôpa/a = 6T/T, and likewise for pressure and any other 
scalars. 

©The existence of solutions with ôsq/ $ equal for all four-scalars s such as energy densities, pressures, 
etc. (but not the detailed solutions (5.4.4)-(5.4.7)) seems to have been generally accepted for a long time. 
An intuitive “separate universe” argument for the existence of solutions for which dpaq/ py are equal 
for all constituents a of the universe has been given by D. H. Lyth and D. Wands, Phys. Rev. D 68, 
103516 (2003); also see D. Wands, K. A. Malik, D. H. Lyth, and A. R. Liddle, Phys. Rev. D 62, 043627 
(2000); A. R. Liddle and D. H. Lyth, Cosmological Inflation and Large-Scale Structure (Cambridge 
University Press, 2000). This reasoning has been extended beyond perturbation theory by D. H. Lyth, 
K. A. Malik, and M. Sasaki, J. Cosm. & Astropart. Phys. 0505, 004 (2005) [astro-ph/0411220]. But this 
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This theorem shows that if the perturbations produced during inflation 
are actually in the modes found above at the end of inflation, then they stay 
in these modes as long as the perturbations remain outside the horizon, and 
in particular R, and D, remain constant, since these modes are solutions 
of the equations whatever the constituents of the universe may become. In 
particular, the wavelengths that will interest us are far outside the horizon 
during the era of reheating that is supposed to follow inflation, so if the 
scalar perturbations are adiabatic at the end of inflation, then reheating 
cannot generate entropic perturbations.’ 

The question that is left unanswered by this theorem is whether the scalar 
perturbations produced during inflation are actually in these modes at the 
end of inflation. This is often a matter of counting. We know that there 
are always two independent adiabatic solutions of the differential equations 
governing the scalar fluctuations, so if these equations have no more than two 
independent solutions, then any perturbations must be adiabatic. As we will 
see in Chapter 10, this counting shows that all solutions are adiabatic for 
inflation with a single inflaton field. 

But it must not be thought that if observation of the cosmic microwave 
background reveals purely adiabatic perturbations at the time of last 
scattering, then the perturbations must have been adiabatic at the end of 
inflation. In a state of complete local thermal equilibrium in which all 
conserved quantum numbers vanish (such as is usually assumed to have 
existed at some early time in theories of cosmological baryonsynthesis or 
leptonsynthesis, like those discussed in Section 3.3) there are only two scalar 
degrees of freedom, the temperature and the gravitational potential ® = W. 
They are governed by coupled first-order ordinary differential equations, so 
there are just two independent solutions, which must be adiabatic since there 
are always at least two adiabatic solutions. Thus whatever happens during 
inflation, if the universe subsequently spends sufficient time in a state of local 
thermal equilibrium with no non-zero conserved quantities, then the per- 
turbations become adiabatic, and they remain adiabatic subsequently, even 
when the conditions of local thermal equilibrium are no longer satisfied.® 

The use of conservation laws to connect different cosmological eras is 
not limited to Newtonian gauge. Indeed, both ¢, and Rq can be put in 


sort of argument only shows that there is a solution satisfying this condition for zero wave number. As 
we have seen, there are indeed many such solutions for zero wave number, most of which have no physical 
significance because they cannot be extended to finite wave number. The proof presented here shows 
that the requirement that the solution can be extended to finite wave number yields just two solutions, 
described by Eqs. (5.4.4)-(5.4.7). It is this requirement that makes it necessary for the infinitesimal 
redefinition of the time coordinate, used in the “separate universe” argument to generate the solutions 
for zero wave number, to be accompanied with an infinitesimal rescaling of the space coordinate. 

7S. Weinberg, Phys. Rev. D 70, 043541 (2004) (astro-ph/0401313). 

8s. Weinberg, Phys. Rev. D 70, 083522 (2004) (astro-ph/0405397). 
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a gauge invariant form. It is only necessary to remark that according to 
Eqs. (5.3.13) and (5.3.14), the quantities 4/2 — Hép/p and A/2 + Hôu are 
gauge invariant, and that they reduce in Newtonian gauge to the quantities 
(5.4.2) and (5.4.1), respectively, so in any gauge 


bq = Aq/2 — Hdpq/p , Rq = Aq/2+ Hôuq , (5.4.22) 


provided of course that A, 60, and ŝu are all calculated in the same gauge. 
For instance, in synchronous gauge Eq. (5.3.38) gives 


gq’ Aq = 81 Ga"5pq — 2Ha Wg , (5.4.23) 


so in this gauge ¢, and R, can be expressed in terms of the convenient 
gravitational variable y4 by 


tg = -a° H Y + 40 Ga? êp — qH ôpq/ Ò, 
qRaq = -a° H Yy + 4r Ga" 5pq + q’ H8uq , (5.4.24) 


of course with the understanding that dp, and dug are here calculated in syn- 
chronous gauge. This result can also be derived directly from the Newtonian 
gauge formulas (5.4.1) and (5.4.2) by using the rules for transforming from 
Newtonian to synchronous gauge given in the previous section. 

There is a convenient general formula for the rate of change of ¢, and Ry 
that holds for g/a « H whether or not the cosmological fluctuations are 
in an adiabatic mode. To derive this formula, we use the Newtonian gauge 
energy-conservation law (5.3.24), which for g/a « H gives 


Sbg + 3H (5pq + bpq) = 3(P + P) Wy , (5.4.25) 
and the corresponding unperturbed conservation law 
= -3H (5 +P). (5.4.26) 
It is then a matter of simple algebra to calculate that for q/a « H 


: PSPq — PSPq 
bq B+ D2 | (5.4.27) 
For g/a « H, the same formula then also gives Rq. It should be noted 
that according to Eq. (5.3.14) the quantity on the right-hand side is gauge- 
invariant, so we can check whether or not ¢, and Rq are time-independent 
by evaluating pôpq — PSpq in any gauge. 

In particular, if for arbitrary perturbations p + dp is a function F (Ø + 6p) 
of the perturbed energy density alone, then p = F' (8) and dpg = F'(p)dpq, 
so the terms in the numerator of the right-hand side of Eq. (5.4.27) cancel, 


253 


5 General Theory of Small Fluctuations 


and so ¢, and Ry are conserved outside the horizon, even if the fluctuations 
are not in an adiabatic mode.? This is the case for instance if all abundant 
particles are highly relativistic, in which case the total perturbed pressure is 
1/3 the total perturbed energy density, or if they are highly non-relativistic, 
in which case the total perturbed pressure is negligible. On the other hand, 
the theorem proved earlier in this section tells us that ¢, and Rq are conserved 
outside the horizon in the adiabatic modes, even in cases (such as inflation 
with a single scalar field) for which the perturbed pressure is not given as a 
function only of the perturbed energy density. 

The proof of Eq. (5.4.27) depended only on the energy conservation 
equations (5.4.25) and (5.4.26), not on the gravitational field equations. 
Therefore, if there is a constituent a of the universe that is energetically 
isolated, in the sense of not exchanging energy with the rest of the matter 
and radiation of the universe, then for g/a « H the quantity 


faq = —Wy T Paq /3 (Pa + Pa) » (5.4.28) 


calculated using the energy density pg and pressure pa of the isolated con- 
stituent, satisfies the gauge-invariant relation!’ 


$ PudPagq = PadPagq 
Cag = BCA ap) ; 
(Pa + Pa) 
In particular, fy, (and hence also Rag) is conserved outside the horizon if 
the perturbed pressure of the isolated constituent is a function only of its 
perturbed energy density.!! (As we will see in Section 6.1, this is actually a 
good approximation for each of the individual constituents of the universe 
at times after the temperature dropped below about 10!° K, when neutrinos 
and antineutrinos no longer had significant interactions with matter and 
radiation.) In this case, Eq. (5.4.28) can be regarded as a convenient for- 
mula for the fractional density fluctuation outside the horizon in Newtonian 
gauge 


(5.4.29) 


Paq 
3 (Da + Pa) 


with ¿aq time-independent. 


= aq + Wy > (5.4.30) 


’Ifthe total pressure is a function only of the total energy density, then the three first-order differential 
equations (5.3.21), (5.3.23) and (5.3.24) form a closed set of equations for Y = ®, du, and dp, so with 
the constraint (5.3.26) there must be just two independent solutions for these quantities, which therefore 
must take the adiabatic form (5.4.4) and (5.4.6), or (5.4.7). But the complete solution is not necessarily 
adiabatic, because other four-scalars (such as the energy densities or pressures of individual components 
of the universe) may not be given by Eq. (5.4.9). 

10, Bashinsky and U. Seljak, Phys. Rev. D 69, 083002 (2004) [astro-ph/0310198}. 
11D, Wands et al., ref. 2. 
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There is a corresponding result in synchronous gauge: For long wave- 
lengths the energy conservation equation (5.3.34) for an energetically isolated 
component of the universe takes the form 


Siig + 3H (piq + ôplq) + (a+ Pu) ya =0, (543D 


the superscript s denoting synchronous gauge. If also po is either negligible 
or a function only of pa, then this can be written as 


d à Sbig 
slae) <M 


Paq 
Pa + Pa 


so that 


2 J Wg dt + Cag - (5.4.32) 


The integration constants cag depend on what we take as the lower limit on 
the integral of yq, but their differences have an absolute significance, and 
are simply related to the differences of the ¢y,. The gauge transformation 
equation (5.3.14) and the equation fg = —3H (fa + Pa) for separate energy 
conservation shows that for any two components of the universe whose 
energy is separately conserved, we have 


5%, 50% pa 5 
SURGES, gg, OEE gs SAN, SORE (5.4.33) 
Pat Pa Pp+PB fPatPa ppt Pp 
Comparing Eqs. (5.4.30) and (5.4.33), we see that 
Cag — Bq = 3 (Gag 4 cpa) . (5.4.34) 


In the special case in which the perturbations are adiabatic, according to 
Eq. (5.4.9) the quantities ŝpoq/ (Pa + Pa) = —3H ôpaq/ fy are all equal, and 
hence the fyq are all equal, and in fact equal to ¢,. According to Eq. (5.4.34), 
the cag will then also be all equal. The cosmic microwave background can 
be used to measure the differences of the Cag Or Cag before perturbations 
re-enter the horizon, and hence to decide whether the cosmological fluc- 
tuations are truly adiabatic. So far, as we will see in Section 7.2, it seems 
they are. 
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Evolution of Cosmological Fluctuations 


We will now apply the formalism developed in the previous chapter to work 
out the evolution of cosmological fluctuations, from a temperature +10? K 
when electron positron annihilation is substantially complete and neutrinos 
have decoupled from matter and radiation, down to the relatively recent 
time when the matter fluctuations become too large to be treated as first- 
order perturbations. Our results will be applied to analyze the observed 
fluctuations in the cosmic microwave background in the next chapter, and 
large scale structure in Chapter 8. 

For reasons discussed in Section 5.4, the connection between the evo- 
lution of fluctuations in this era and what happened at earlier times will 
appear in a few parameters including Rg (or ¢,), which are conserved dur- 
ing the many e-foldings of expansion when the perturbations to the various 
constituents of the universe were still outside the horizon — that is, when 
the physical wave number g/a was much less than the expansion rate å/a. 
The values of these parameters when outside the horizon thus characterize 
the strength of the various perturbations. In Chapter 10 we will see what 
can be understood about these parameters from a study of the much earlier 
era of inflation, and in this way work out what large scale structure and 
fluctuations in the cosmic microwave background can tell us about the era 
of inflation. 

Section 6.1 presents the equations governing scalar perturbations. These 
equations take a simple hydrodynamic form for cold dark matter and the 
baryonic plasma, but for calculations of high accuracy it is necessary to 
use the Boltzmann equations of kinetic theory to follow the detailed dis- 
tribution of photons and neutrinos in phase space.! These equations are 
too complicated for an analytic treatment; that is a task for comprehensive 
computer programs such as CMBfast” and CAMB. Unfortunately such 
computer programs do not lend themselves to an exposition aimed at an 


1P. J. E. Peebles and J. T. Yu, Astrophys. J. 162, 815 (1970); R. A. Sunyaev and Ya. B. Zel’dovich, 
Astrophys. Space Sci. 7, 3 (1970). 

2The original code for the Boltzmann equations for photons and neutrinos and the other dynamical 
and gravitational equations on which CMBfast is based was written by C.-P. Ma and E. Bertschinger, 
Astrophys. J. 455, 7 (1995) [astro-ph/9506072]. An important element discussed below was added 
by U. Seljak and M. A. Zaldarriaga, Astrophys. J. 469, 437 (1996).[astro-ph/9603033]. Also see 
M. Zaldarriaga, U. Seljak, and E. Bertschinger, Astrophys. J. 494, 491 (1998); M. Zaldarriaga and 
U. Seljak, Astrophys. J. 129, 431 (2000). The program is available on the website www. cmbfast . org. 

3This program is based on CMBfast. It was written by A. Lewis and A. Challinor, and is available 
at camb.info/. 
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understanding of the physical phenomena involved. Therefore in subse- 
quent sections we will present hydrodynamic calculations that are simple 
enough to be done analytically, aside from a few numerical integrations, 
and yet realistic enough so that the results are a good approximation to the 
more accurate results of computer programs. 

The general equations and initial conditions for our analytic treatment 
of scalar modes are given in Section 6.2. Our analytic treatment of scalar 
modes then divides into the study of two wavelength regimes: wavelengths 
long enough to have come within the horizon during the matter-dominated 
era, to be considered in Section 6.3, and wavelengths short enough to have 
come within the horizon during the radiation-dominated era, considered in 
Section 6.4. Section 6.5 will show how to interpolate between the solutions 
found for these long and short wavelengths. Section 6.6 treats the evolution 
of tensor perturbations. 

In Chapter 7 the results of this chapter for both scalar and tensor modes 
are applied to the anisotropies and polarization of the cosmic microwave 
background. Chapter 8 takes the treatment of matter perturbations beyond 
the time of last scattering, with results applied to observations of the cosmic 
distribution of matter. 


6.1 Scalar perturbations — kinetic theory 


It seems highly likely that from the beginning of the period of interest here, 
from just after et — e~ annihilation at a temperature T ~ 10? K, down 
to the time of last scattering when T ~ 3,000 K, the universe consisted of 
just four components: photons, cold dark matter, neutrinos, and a bary- 
onic plasma consisting of free electrons, ions, and neutral atoms. In this 
section we will consider the perturbations in scalar modes to each of these 
four constituents in turn, adopting for this purpose the synchronous gauge 
described in Section 5.3. Each perturbed quantity X (x, t) (such as dp, dp, 
A, B, etc.) is written as a Fourier integral and a sum over modes, as in 
Eq. (5.2.1): 


X (x,t) = Efe an (q) Xng(t) e!4™ , (6.1.1) 


where œn (q) is the stochastic parameter for the n-th mode. In particular, the 
metric perturbation in synchronous gauge is given by ôgoo = 0, dgi9 = 0, 
and 


ôg D =a Y J Pq anO) [An Oy ~ qij Ba d|, (6.1.2) 
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(Note that q is the co-moving wave number, related to the physical wave 
number k by k = q/a. It is common to define a so that a = | at the 
present time, so that the co-moving wave number equals the present value 
of the physical wave number, in which case the co-moving wave number is 
often denoted k. We will instead leave the normalization of a arbitrary, and 
reserve the symbol k for the physical wave number q/a.) In this section we 
will consider any one mode, dropping the label n; the equations we find will 
have a number of solutions, which define the various modes. 


Cold dark matter 


The individual cold dark matter particles are assumed to move too slowly 
for them to produce an appreciable pressure or anisotropic inertia, and, as 
shown in Section 5.3, the absence of pressure or anisotropic inertia allows 
us to adopt a particular synchronous gauge in which the cold dark mat- 
ter fluid velocity uly vanishes. That is, the coordinate mesh is tied to the 
dark matter particles in such a way that they remain at rest despite fluc- 
tuations in the gravitational field in which they move. Cold dark matter 
is therefore characterized solely by a total density pp(t) + dpp(x, t), with 
the unperturbed density p(t) simply decreasing as a~>(t), and the Fourier 
transform of the perturbation 5pp(x, t) governed by the equation (5.3.34) 
of energy conservation with zero pressure and velocity 


5PDq + 3H dp pq = —ppWq : (6.1.3) 
(Recall that yy = (34g — q°By)/2.) 


Baryonic plasma 


The Coulomb interactions of electrons and atomic nuclei are sufficiently 
strong so that they act together as a single perfect fluid. In the era of 
interest both electrons and nuclei are highly non-relativistic, so the baryonic 
plasma has negligible pressure and anisotropic inertia, and therefore the 
unperturbed density g(t) goes as a~3(t), and the Fourier transform of 
the density perturbation 5,(x, t) is governed by the energy-conservation 
equation (5.3.34), but now with a non-zero velocity potential dug(x, t): 


5 pq + 3H dppq — (q7/a”) Ppdupg = -BY - (6.1.4) 


On the other hand, Thomson scattering allows the baryonic plasma to 
exchange momentum with photons, so it is the combination of photons 
and plasma that satisfies the equation (5.3.32) of momentum conservation, 
which here reads 


4_ 
Spy q— 91>, + [o + 3H] [Prus + 3 Py buy a| =0, (6.1.5) 


259 


6 Evolution of Cosmological Fluctuations 


where dp, and du, are the pressure perturbation and velocity potential of 
the photons. We can regard this as the equation of motion of the baryonic 
plasma, with the photonic quantities ne Pyq, SUygq calculated as described 
in the next subsection. 


Photons 


Well before the time of recombination the density of free electrons was high 
enough so that photons could be described hydrodynamically: Thomson 
scattering gave the photons a total momentum locked to that of the bary- 
onic plasma, so that du, 4 = dug,, and a momentum estabuHon that was 
isotropic in the co-moving frame, so that dp, 4 = dpy 4/3 and n3 q = 0. But 
for a highly accurate treatment of photons around the time of recombination 
it is necessary to treat them kinetically, studying the distribution of photons 
in momentum space, and taking account of photon polarization. As dis- 
cussed in Appendix H, this distribution is an Hermitian number density 
matrix n? (x, p, t), defined so that the number of photons in a space volume 
Į[; dx’ with momenta in a momentum-space volume ||; dpi, weighted with 
the probability of their having polarization e’, is eje; *nÜ (x, p, © J [y dx*dpg, 


and p;nł = 0. (The polarization vectors satisfy iie normalization condi- 
tion gye/e/* = 1 and the transversality condition pje’ = 0, and e; = gige“. 
For further discussion of polarization, see Appendix G.) For small perturb- 
ations, this distribution is conveniently written in the form 


ik il 
n! (x, pt) = shy (amp, p, D) es ‘i= a(x, Og" (x, pez 


[p° (x, p, 1? 
+6n! (x, pd), (6.1.6) 


P(x p, © = J gÏ (x, Opip; . (6.1.7) 


Here ï, (p) is the equilibrium phase space number density 


where 


yD) = [exp (pko to) -a 6a 


with T (t) the unperturbed temperature of the baryonic plasma, and én’ is a 
small intrinsic perturbation. The first term in Eq. (6.1.6) is just the distribu- 
tion matrix for unpolarized photons in thermal equilibrium at temperature 
T(t), written in a general spatial coordinate system, so ôn includes the dyna- 
mical rather than the purely geometric effect of metric perturbations on the 
photon distribution. (Note that the factor a(t) in the argument of n, in 
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Eq. (6.1.6) is canceled by the factor a(t) multiplying T(t) in Eq. (6.1.8); this 
factor is introduced because in the era of interest T(t) x a~!(t), so that as 
we have defined it, n, is a time-independent function of its argument.) 

Since the sum over polarizations of ee; is gj, the phase space den- 
sity of photons is gin! . Hence the total energy-momentum tensor of the 
photons is 


3 
l jP“ Pv 
Ths a | l | an gijn” é (6.1.9) 
Y i 0 
/ Det g 4 p 


The first term in Eq. (6.1.6) contains first-order perturbations arising from 


the metric perturbations in p? = gp; and p? = —po = ,/g‘/pip;, in the 
metric determinant Detg, and in the factor gj in gin! . It is straight- 
forward though tedious to show directly that all these contributions to 
ôT”, cancel, but this can be seen more easily by noting that Eq. (6.1.9) 
shows that the contribution of the first term in Eq. (6.1.6) to 57),;(x, t), 


ST) (x,t), STyo(x,t), and 8TYo(x,t) are local functions of the 
three-metric g;;(x, t) that transform under general spatial coordinate trans- 
formations as a mixed three-tensor, a covariant three-vector, a contravariant 
three-vector, and a three-scalar, respectively. But there are no non-trivial 
local functions of the three-metric that transform in this way under spatial 
coordinate transformations! This leaves only the contribution from the trace 
of bn}. That is, the total first-order perturbations to the mixed components 
of the energy-momentum tensor are 


J 
kPk 


i l : Pip 
87,700 = ap J (Ian) a(t) nk (x, p, t) z (O10) 
1 3 
ST yi(X 0) = za | (11 ax) a(t) 5n\* (x, p, 1) Dj » (6.1.11) 


1 3 
5TYolx, t)= -a5 | (11 in) a’ (t) Sn (x, p, t) VPkrPk - (6.1.12) 
k=1 


Evidently, all we need to calculate these perturbations is the integral of 
snik (x, p, t) over photon energy, weighted with a single factor of energy. We 
therefore introduce a dimensionless intensity matrix Ji(x,p,t), 
defined by 


AO by OI. =a) f snipp DA dp, (6.1.13) 
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where p,(t) = a~t (t) f Arp i, (p) dp is the unperturbed photon energy 
density. The components (6.1.10)—(6.1.12) then become 


Tij) = yO) io P IilXsb.0 Piby (6.1.14) 

STX, t) -a00 f E — Tk (X,p, 0) Pj » (6.1.15) 
dp 

ST yYo(x, t) = —Ay (0) T Tiel, pt). (6.1.16) 


As shown in Eq. (H.37) of Appendix H, the perturbation ôn? is governed 

by the Boltzmann equation 
ddn(x,p,t) pe adn (x,p,t) , 2a(t) 
at a(t) axk a(t) 


bn! (x, p, t) 


— gag” POPPI, © (Ogui, D) (8y — Bib) 


P 3welt A 
= —we(t) ôn” (x, p, t) + ol) [em 
8x 
x | bn! (x, ppi, 0) — Bide 8n (x, pôi, D — PjPr 8n” (x, pir, 
RK Bee kl a 
+ PiPjPKPI ôn (x, pôi D| 
- $5 pxdupe it, (p) (8y — PiP;) (6.1.17) 
Walk Bk ly, ij — PiPj} > oh 


where now p = ./pipi, Pp = p/p; w(t) is the collision rate of a photon with 
electrons in the baryonic plasma; and dug, is the peculiar velocity of the 
baryonic plasma. We can derive a Boltzmann equation for the dimensionless 
intensity matrix by multiplying Eq. (6.1.17) with 4zp? and integrating over 
P = \/Pipi, using 


4x f pin, (p) dp = -167 Í pP ï, (p) dp = —4a*(t)p, (0) . 
0 0 


Writing the intensity matrix and plasma velocity as Fourier transforms 


Jy (x, p, t) = d’q Jy(q, P, t) efx 5 ôĉupk (X, t) = d’q ôupi(q, t) ela x 
J y 
(6.1.18) 
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we find 
aJ; (q...) -q 


— Jy (q, p, t 
a eio EP? 


+a(q) [40O = a ABO] (8y — Aib) 


^ 3we(t) x 
= —we(t) Jj (q, P, t) + a [em 


x Face t) — Pik Jri (q, P1. O — PiPK Jik (q, P1, £) 


+ DiPjPKPi Jri (q, P1. n| 
2we(t) 
a(t) 


in which we have used Eqs. (5.3.27) and (6.1.2) for the metric perturbation. 

The intensity matrix and plasma velocity are proportional to the stochas- 
tic parameter a (q) for whatever mode is under consideration, which contains 
all information about initial conditions. Apart from this factor, there are no 
preferred directions in the problem, so the coefficient of w(q) in the intensity 
matrix can be decomposed into a sum of terms proportional to the two sym- 


metric three-dimensional tensors 6; — pp; and (à —(ĝ- DP) (ù —(q- DP) 
that vanish when contracted with p; or p;, with coefficients that depend on 


the directions ĝ and p only through the scalar product ¢-p. This decompos- 
ition is conventionally written as 


[sy = PiP;| ÎÊkõugk(q, t) , (6.1.19) 


A 1 S A A S A A ROR 
Jj P. t) = alq) (APG. -p= AP aap: D) (3 - pip) 


(a: - @- pp) la-a DP) 


+A9 q, å- P, D) A . (6.1.20) 


(The subscripts T and P stand for “temperature” and “polarization.” Note 
that the trace J;;, which is all that appears in the energy-momentum tensor, 
is proportional solely to AN , but we need to keep track of AO because it 
is linked to AY through the dynamical equations.) Similarly, the integral 
over p appearing on the right-hand side of Eq. (6.1.19) may be expressed in 
terms of a pair of “source functions” ®(q, t) and TI (q, t) as 


dp : La 
| fava. t) = a (q) EZ t) + 7G, J ; (6.1.21) 
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and as usual we write 
Supi(q, t) = ior(q)gidup g(t) . (6.1.22) 


Inserting Eqs. (6.1.20)—(6.1.22) in Eq.(6.1.19) yields the coupled Boltzmann 
equations for A and AY ), 


.( Ib 
Aq, mt) +i (£) Aq, u, D = (DADE, u, t) 
3 
+ zeel) (l -— DNQ, À , (6.1.23) 
AO .( Le \ AS) _ (S) 
r (Gh +i a(t) Ar (q, 4, t) = —wc(t)AF (q, HD 


— 2Ag(t) +20 u BA) 


+ 3we(t) ®(q, t) + Felt) — uI (q, t) + 4iquwe(t)ôug q(t), 
(6.1.24) 


with ® and TI defined by Eq. (6.1.21). 
The usual approach‘ to the solution of these Boltzmann equations is 
through an expansion of Ae and Ao) in partial wave amplitudes: 


CO 

AP, u, Ð = YOE +1) PW ANG (6.1.25) 
£=0 
CO 

AM Gt) =i Ql+ DPW ALIGN. (6.1.26) 
£=0 


To derive the Boltzmann equations for the partial wave amplitudes, we use 
the recursion and normalization relations for Legendre polynomials 


+1 2 
(2641) uPo(u) = (C41) Pegi (u) +EP), J Pw du= x5 


4M. L. Wilson and J. Silk, Astrophys. J. 243, 14 (1981); J. R. Bond and G. Efstathiou, Astrophys. J. 
285, L45 (1984); R. Crittenden, J. R. Bond, R. L. Davis, G. Efstathiou, and P. Steinhardt, Phys. Rev. 
Lett. 71, 324 (1993); C. -P. Ma and E. Bertschinger, Astrophys. J. 455, 7 (1995). 
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Multiplying Eqs. (6.1.23) and (6.1.24) by Pe(u) and integrating over u then 
gives 


ACS) O nO 
Ape +t aago t DAR E 


1 5 
= -0A + soll (so d 2), (6.1.27) 


(S) (S) (s) 
Ar Deen aye Arey — Are ‘| 


l . Sw 28 
= —2A45¢9 + 24b ($ z 2) 


1 1 
= woh, + We (3 + sn) ogo + Ta 602 


4 
— 34M SUB g Sel (6.1.28) 
We can express the source functions in terms of AY q and AP) by inserting 


Eq.(6.1.20) in Eq. (6.1.21), and using the integral oria, “that for any 
function f (ĝ - P), 


a zl LP. G- PPiPi = Ady + Bd » 
where 


1) i 
A=3f aufwd-w)= 2 f dus (Pow - Paw) 
—1 -1 


1 f! 1 f! 
za J ) du f (w) Bu? — 1) = 5 J ; du f(u) Pa(u) 
(The general form of the integrals is dictated by rotational invariance, while 
the formulas for the coefficients are found by contracting the integral with 
6, and with ĝ;ĝ;, and then solving the resulting pair of linear equations for 


A and B.) Using this, we can easily evaluate the two terms in the integral 
(6.1.21) (here dropping the arguments q and t): 


l EEOAE O 2 
o | (aS (q-p)— Ap aD) (ô; — Pipi) 


-f a (AP aw) (3P (Ore: w) a- o 
st, AE p OUT T O NA 
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ae BAOG 5) (4 qi — | oe q -p)p;) 


1 1 
-f du aol (a Po(u)t+> 5P) byt (4 Po(uw)—5 Paw) iis | 


In this way, and using the partial wave expansions (6.1.25) and (6.1.26), we 
find 


it fee, seh. a ats 
an Paro - Apo- AF- A] : (6.1.29) 
T= Apo + AF) + Apa. (6.1.30) 


In the same way, using Eq. (6.1.20) in Eqs. (6.1.14)-(6.1.16) gives 
1 
TiD = BH f Pg era f du AP) 
(p DOE ROO 
x =( o(u) — 2(1)) ij + 5 2()diqj | » 
a pl 
THK) =a A) | Paa f dua Pand, 
ST? ZA 1 = 3 iq:x : (S) 
yox, t) T —5Pr® d q e a (q) AT (q, Ms t) Polu) ’ 
in which we have used the formula 
l i hos eR E E 
o | Pira D= 1f aufa. 


Comparing this with the first three of Eqs. (5.1.43), and again using the 
partial wave expansions (6.1.25) and (6.1.26), we find 


pya = Z (a$ Apta), (6.1.31) 
EAN T (6.1.32) 
ee Spy ys (6.1.33) 
qty q = SNA À (6.1.34) 


As a check of Eqs. (6.1.31)-(6.1.33), note that when used in the last of 
Eqs. (5.1.43) these results give ôT is , = 0, a necessary consequence of the 
masslessness of the photon for any distribution of photon momenta. 
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Eqs. (6.1.31)-(6.1.34) show that to work out the cosmological evolu- 
tion of the gravitational field and its effect on other perturbations in scalar 
modes, all we need to know about photons is nS for € < 2. But of course 
the evolution of these three amplitudes is coupled by the Boltzmann equa- 
tions (6.1.27)-(6.1.30) to both AY) and A> for all higher £. In computer 
programs like CMBfast and CAMB, the Serial wave expansion is cut off 
at a sufficiently high value of £; in the latest version of CMBfast, the max- 
imum value of £ is taken as ¿max = 12, in which case the computer has 
to solve 2(€max + 1) = 26 coupled ordinary differential equations for each 
value of g, not counting the other equations that describe the evolution 
of the baryonic plasma, cold dark matter, neutrinos, and the gravitational 
field. 

As we will see in the next chapter, the interpretation of observations of 
the cosmic microwave radiation background requires calculation of AY 


and A for £ ranging to values well over 1,000. Originally this was done 


by a direct use of the coupled Boltzmann equations (6.1.27) and (6.1.28), 
but this required hours or even days of computer time for each theoretical 
model. A great improvement was introduced with the suggestion to use 
instead a formal solution of the Boltzmann equation (6.1.19), in the form 
of a “line-of-sight” integral,’ which in matrix notation takes the form 


t t dt" t 
Jii(Q,P, D = aQ) J dt’ exp (-ia-2 f — -f dt" ot) 
ti t a(t ) t 


x | — (8% — ibs) (Aq) - O-b) 


3@(t') 
z 2 
3we (t’) 
4 


(dy — PiE (q, t’) 


+ TI (q, Nhà- — Îi ĝ - B)) (% — Pi (q- D) 


w(t’) Reade 
ae) [dij — PPA PrdUK(, l 


+ Jj (q, Pp, t1) exp (-ia of 4 — ‘at Oc(t ve (6.1.35) 
ti at) 


t 


where f; is any arbitrary initial time. If we choose t; to be sufficiently 
early, before recombination, so that we(t,) >> H (tı), and take ¢ at any time 


5U. Seljak and M. A. Zaldarriaga, Astrophys. J. 469, 437 (1996). [astro-ph/9603033]. 
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after recombination, the final term in Eq. (6.1.35) may be neglected, and 
we have 


t t dt" t 
Jj (QP, n=aw | dt’ exp (-i4-2 f —- | dt" ott") 
t t a(t") a 


x | ~ (8y = Bibj) (40) - @ -ba )) 


3welt’) ek 
+ e (êj — pidj) P(g, t’) 
3a¢(t’) / A A GA A A A ya A 
+ 4 na h(t- BiG -P) (4-7-8) 
2o,-(t' Beste ete 
=o i ~ pid] PrSuk (A, | | (6.1.36) 


In terms of the temperature and polarization amplitudes defined by the 
decomposition (6.1.20), the line-of-sight solution reads 


t t dt" t 
AP asst) = AG, D2 | dr exp [~ion ea) vara 
ti t € 


. , 2iuqoe(t , 
x [aa + 12a? By lt) +500) Batya OO su | 


a(t’) 
(6.1.37) 
3 ' / s dt” i 1 H 
AiG u, t) = ql = u’) J dt’ exp [ian f at") -f W(t’) dt | 
ti t Y 
x w(t) Tq, t’), (6.1.38) 


where du, is the scalar velocity potential, defined by ôu (q, t) = iqkuq(t). 
Once ® and IT have been calculated from Eqs. (6.1.29) and (6.1.30), 
Eqs. (6.1.37) and (6.1.38) can be used to calculate As ey and AP for 
arbitrarily high values of £. (It is also possible to use Eq. (6.1.36) as a 
substitute for the partial wave expansion in calculating the source terms ® 
and II. Integrating Eq. (6.1.36) over p yields integral equations: expres- 
sions for ®(q, t) and TI (q, t) as integrals from ft; to ¢ in which the integrand 
is a linear function of ®(q.t’) and TI (q, t’) for t < t. This approach will be 
applied to tensor modes in Section 6.6.) 


6s. Weinberg, Phys. Rev. D 74, 063517 (2006) [astro-ph/0607076]; D. Baskaran, L. P. Grishchuk, 
and A. G. Polnarev, Phys. Rev. D 74, 063517 [gr-qc/0605100]. 
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Neutrinos 


The number density n(x, p,t) of each species of massless neutrinos (or 
antineutrinos) in phase space can be conveniently expressed in terms of an 
intrinsic perturbation ôn, (x, p, t) by a formula like Eq. (6.1.6): 


ny (Xx, p, t) = ny, (ap, p, D) + ôn, (X, p, t), (6.1.39) 


where 7, is the equilibrium phase space density of each neutrino species 


fn) = Ge att [exp (piksa?) +1] A (6.1.40) 


As shown in Appendix H, the perturbation 67,,(x, p, t) satisfies the same 
Boltzmann equation as the photon phase space density a? (t) nee (x, p, 2), 
except that for T « 10!°K, the terms proportional to the collision frequency 
are absent: 


06n,(X,p,t) Ôk 3n (X, p, t) 
at a(t) axk 


0= 


1 PEE E E 
A OOE) (6.1.41) 


The contribution of each species of neutrino to the perturbations to the 
energy-momentum tensor is given by formulas (6.1.10)—(6.1.12), except that 
ôn, (X, p, t) appears instead of a? (t) snk (x, p, t). Once again, all we need for 
this purpose is a dimensionless direction-dependent intensity, defined by a 
formula like Eq. (6.1.13): 


a*(t)py (NI (x,p, 1) = Ny I wenn dp , (6.1.42) 
0 


where N, is the number of species of neutrino, counting antineutrinos 
separately, and 5, = N,a~* f 4x p>, (p) dp. Then the total neutrino and 
antineutrino contribution to the energy-momentum tensor is 


STi, D = pol) ee CP 1¢x,8, Pi 5 (6.1.43) 
8T? (x, D = al) pv) je EP ig POD; (6.1.44) 
ST, t) = —py(t) ie ERAON (6.1.45) 
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just as in Eqs. (6.1.14)-(6.1.16). Rotational and translational invariance 
allow us to express J (x, p, t) as a Fourier integral of the form 


J(x,p, t) = J ala) AP (q, å- P, De® d?q (6.1.46) 


To derive a Boltzmann equation for AS we multiply Eq. (6.1.41) with 
47x |p|? and integrate over |p|, and find 


a AP (q, u, t) S 2 2 
+i P AO, u, t) = —24g(t) + 2 ub) . (6.1.47) 
ot a 
In computer programs like CMBfast, the Boltzmann equation for neu- 
trinos as well as for photons is solved by a partial wave expansion. One 
writes 


AS (4g, u, 1) = D + DP) AS, D) . (6.1.48) 
£=0 


Inserting this in Eq. (6.1.46) and then in Eqs. (6.1.43)-(6.1.45) and then 
comparing the results with the first three of Eqs. (5.1.43) gives the perturbed 
pressure, scalar anisotropic inertia, perturbed energy density, and velocity 
potential of the neutrinos 


spra = ČP (ag.0+ aq). (6.1.49) 
PRO) = POAIGD, (6.1.50) 
dpv q(t) = Pv (t) A (q,t) , (6.1.51) 
q buy q(t) = A (q,0). (6.1.52) 


To derive the Boltzmann equations for the partial wave amplitudes, we 
multiply Eq. (6.1.47) with Pe(j) and integrate over u: 


q 
AN G9 + Tap ape t DARD ~ LA G9] 
26 45 
= -24030 + Pho (8-52) . (6.1.53) 


In the current version of CMBfast, this equation is cut off at a maximum 
value of £ equal to 25. Instead of relying on a truncated partial wave expan- 
sion, it is possible to write a solution (here not merely a formal solution) of 
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Eq. (6.1.47) as another line-of-sight integral: 


AMG, u, t) = -2 J peer J ' a) (49) = wab) 
v oh ; Y a(t") q q 


t 


t 1 
+A (q, u, t1) exp (~ian J a) (6.1.54) 

ti a(t’) 
where tı is any arbitrary initial time. If tı is taken at some time after the 
neutrinos went out of thermal equilibrium with the baryonic plasma, but 
early enough so that gravitational field perturbations have not yet had a 
chance to distort the neutrino distribution, then A) (q, p, t1) arises only 

from the temperature perturbation at time t1: 


8Ty(t1) ed 
T(t) alti) |’ 


AS, u,t1) =4 | (6.1.55) 


in which the second term in square brackets arises from the Doppler effect 
due to the streaming of the electron—positron—photon plasma. 


The integrals over direction in Eqs. (6.1.43)-(6.1.45) can be done ana- 
lytically; for this purpose we need the formulas 


d.aa n i. . 
<P 5 DidKbLe PY — ( Sijôkr + dic di + Siôr ) ja(v)/v™ 
4r 


= (j ôkÂr + Sig bib) + 510.0; + Si bib] + 1040; + Skid) j3(v)/v 


+ ôd; dk j4(v) , (6.1.56) 
PPan a ip ae : hae. 

J Jy DiPiPe PY = =] (di + 56) + Suid )i2(0)/ v 
+ 10; 0; Ôk j3 (v) ; (6.1.57) 
AEE ERE PE 
Zn lP! e = yf (v)/v — iV ja (v) (6.1.58) 
dp A —ip-v ^ > 
—p;e PY = —iùi ji (v) (6.1.59) 
An 
ap s 

J qe =j), (6.1.60) 
4r 

where v = |v|, and je(v) is the usual spherical Bessel function. Again 


comparing the results with the first three of Eqs. (5.1.43), we obtain explicit 
formulas for the perturbed pressure, scalar anisotropic inertia, perturbed 
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energy density, and velocity potential of the neutrinos: 


t 
ôPv q(t) = -250 f dt' 
ti 
TE t dt" E t dy! 
x | Agr Bi (af fz) B4 (t) Ko g a) (6.1.61) 
t 
Prp) = —2py (0) J dt 
t 
; ee t dt" 2 t dt" 
a [Aut )j2 (af i5) — q“ B4 (t) K3 (af a) (6.1.62) 


t 
Ane J dt’ 
an 


f Mg t dt" sn 3 t dt" 
x [Auton (af aa) Olaf ah E 


t 
dpy g() = -250 | dt' 
ti 


a Taped t dt" es t dt! 
X aut ) Jo gi =) — q“ By(t') Ks (af 5)! (6.1.64) 


where 
Ki(v) =fji)/v, (6.1.65) 
Kaw) = hw) —ja(v)/v, (6.1.66) 
K3(v) = —2j2(v)/v* + 5 0)/v — ja) , (6.1.67) 
Kav) = 3j2(v)/v — ja) , (6.1.68) 
Ks(v) = fi(v)/v — jou) . (6.1.69) 


Using these formulas, one no longer needs the truncated partial wave 
expansion for neutrinos. 
Gravitation 


It only remains to give the equations of motion for the scalar gravitational 
field components A and B. (Note that A and B themselves are nowhere 


7 These formulas are given also for massive neutrinos by S. Weinberg, ref. 6. 
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needed.) It is convenient to take one of these as Eq. (5.3.31), written as 
ae va = —47 Ga? 
ðt 


x (Sen, + dpBq + Spy q + Pvg + 36Py q + 35Pvg — qm i = Gr) 


(6.1.70) 
where 
bq = 5(34y = qb) ! (6.1.71) 
The other can be taken from Eq. (5.3.30) 
A= 80 Gl $y Buy $ $ Suduva + prdupg| . (6.1.72) 


(Recall that we have adopted the particular synchronous gauge in which 
dupq = 0.) After solving the first-order differential equation (6.1.70) for Yg 
and using Eq. (6.1.72) to find Aq» the other component is trivially given by 
the definition Eq. (6.1.71) as 


qg Bg = 344 — 2Wq « (6.1.73) 


With this, as long as we truncate the partial wave expansions used for 
photons and neutrinos, we have a closed system of ordinary differential 
equations for all the perturbations, which can be straightforwardly solved 
by computer for any given initial conditions. 

To find initial conditions, we note that at a sufficiently early time f1, well 
before the era of recombination, (say, for T (t1) > 10° K), the collision rate of 
photons with the baryonic plasma is so great that photons are in thermal and 
kinetic equilibrium with the plasma. Under these conditions, the photon 
distribution ôn? arises only from a perturbation to the temperature in the 
first term of Eq. (H.31), including the Doppler shift due to the photon 
streaming velocity, which in equilibrium is the same as the baryonic plasma 
velocity dug: 


i l E T(x, tı) Pkõupgk(X, t1) 
ôn? (x, p, t1) = E (ny [84 - bib | Hp ee 


T(t) a(tı) 
(6.1.74) 


(The factor 1/a(tı) in the Doppler term in Eq. (6.1.74) was explained in 
connection with Eq. (H.13).) Note that for w, very large, the coefficients of 
œc on the right-hand side of equation (6.1.17) must cancel, which gives an 
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initial condition consistent with Eq. (6.1.74). Multiplying with 4xp?° and 
integrating over p, we find a corresponding condition on Jj: 


(6.1.75) 


7 na] [OT (Kt) | pedug(x, tı) 
Jij(X, P, t1) = 2| 8% ~ ĝi; | | É | 


T (ty) a(tı) 


Note that Jj(x, p, ti) receives contributions only from scalar and vector 
perturbations, not from tensor perturbations. 

Similar remarks apply to neutrinos, except that we must go back to 
an earlier time, when the temperature was a little below 10!° K, so that 
neutrinos were already traveling freely, but not enough time had elapsed for 
the gravitational field perturbation to have altered the equilibrium form of 
the neutrino phase space distribution. 

This still leaves us with the necessity of stipulating initial values for Aj, 
By, Sdq> 5Bq, 5UBg, and 5T,. For this, we must go back to a time early 
enough so that the wave numbers of interest were outside the horizon, in 
the sense that g/a « H. In the following section the needed initial values 
will be worked out for the dominant adiabatic mode, with a normalization 
expressed in terms of the quantity Rq, given in synchronous gauge outside 
the horizon by Eq. (5.4.24). 


6.2 Scalar perturbations — the hydrodynamic limit 


The system of equations described in the previous section is much too 
complicated to allow an analytic solution. Fortunately, until near the 
time of recombination the rate of collisions of photons with free elec- 
trons was so great that photons were in local thermal equilibrium with 
the baryonic plasma, and so photons at these times can be treated hydro- 
dynamically, like the plasma and cold dark matter. This approach loses its 
validity around the time of recombination, but a fair degree of accuracy 
will be preserved in Section 6.4 by taking into account the damping caused 
by the growing mean free times in this era. After the time of recom- 
bination photons traveled more or less freely, and their path can be fol- 
lowed by solving their equation of motion. Neutrinos are more of a 
problem, but at very early times perturbations were outside the horizon, 
so at these times q°7 $, and qu, q were negligible and Eqs. (6.1.49)-(6.1.51) 
show that dp,, = 3ôpvq, just as if the neutrinos were in local thermal 
equilibrium, while at late times the universe became matter dominated 
and neutrinos made only a small contribution to the cosmic gravitational 
field. 
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With these justifications, in order to allow an analytic treatment, in this 
and the next three sections we will adopt a hydrodynamic approach.! To 
be specific, for the most part we will neglect anisotropic inertia, take du, q 
equal to dugy, and take py g = pyq/3 and pyg = Pyq/3. This necessarily 
entails the loss of some accuracy, but our aim in this chapter (and in the 
next two chapters) is not to calculate the course of cosmic evolution and 
its observational consequences with the high level of accuracy that would 
optimize the extraction of cosmological parameters from the latest data 
on the cosmic microwave background and large scale structure. Rather, 
we wish here to elucidate the physics of cosmic evolution, and clarify the 
dependence of observables on cosmological assumptions. Fortunately, the 
results we obtain from this analytic treatment will turn out in Chapter 7 to 
yield predictions for anisotropies in the cosmic microwave background that 
are quite similar to those obtained by comprehensive computer programs, 
using the full Boltzmann equations described in the previous section. We 
are thereby reassured that the hydrodynamic approach captures the essence 
of what is going on in the early universe. 

Under the above assumptions, the Fourier transforms with co-moving 
wave number q of the synchronous gauge perturbations are governed by the 
gravitational field equation (5.3.36): 


d/ 2 
Fla Yq) = —4r Ga? (8pp4 + 8084 + 28pyq + 28pvq) » (6.2.1) 
the equations (5.3.34) of energy conservation for each of the four fluids 
Sbyq + 4Hôpyq — (4g? /3a7) By Suyg = —(4/3)pytqg, (6.2.2) 
Sp + 3Hôpp4 = -PDY , (6.2.3) 
beq + 3H Sppq — (1/4) gsuyq = -ÖB Y4 » (6.2.4) 


bvg + 4Hôpvg — (4q7/3a’) pySttyg = —(4/3) vg, (6.2.5) 


and the equations (5.3.32) of momentum conservation for the photon- 
baryon plasma and the neutrinos: 


d (/4_ : 4_ E 
A (Ga + pa) 4) +3H (Ga + A) ya) = —(1/3)5pyq , (6.2.6) 


d /- Z 


lOther analytic or semi-analytic treatments of the evolution of fluctuations have been given by W. Hu 
and N. Sugiyama, Astrophys. J. 444, 489 (1995); 471, 542 (1996); V. Mukhanov, Int. J. Theor. Phys. 
43, 623 (2004) [astro-ph/0303072]. The treatment given here is in my opinion more transparent though 
somewhat less accurate than that of Hu and Sugiyama, and (because we allow ourselves the use of a 
computer to do numerical integrals) it is more accurate than that of Mukhanov. 
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It is very convenient to rewrite these equations in terms of the dimen- 
sionless fractional perturbations” 


Paq 
“1 Beit Dox 
where œ runs over y, D, B, and v. Taking into account that a*p,, a> Ap, 
a? pp, and afp, are all time-independent, Eqs. (6.2.1)-(6.2.7) now read 


d _ _ 8 _ 8 _ 
Fhe) = —47 Ga? (Av. + PBoBq + =Pyôyq + SPd) » (6.2.9) 


3 

byq — (Q /@ suya = —q » (6.2.10) 
nq = -V4 > (6.2.11) 
ÔBq = (q° /a7)5uyq = —Wq > (6.2.12) 
vq = (q /a*)bUyq =—-Wy > (6.2.13) 

d (A+ R)buyg 1 
= —— ô .2.14 
dt ( a 3a yq > (6 ) 

d (buUyg 1 

ae = —— ôq » 6.2.15 
dt ( a ) 3a "4 ( ) 


where R = 3øg/4p,. Eqs. (6.2.1)-(6.2.7) or (6.2.9)-(6.2.15) form a closed 
system of seven first-order differential equations for yq, the four density 
perturbations, and the plasma and neutrino velocity potentials, so there 
must be seven independent solutions. 

Before trying to find solutions valid up to the time of recombination, we 
must first consider the initial conditions to be imposed. These initial con- 
ditions will distinguish the different independent solutions. At sufficiently 
early times the universe was in a radiation-dominated era, when py < ØR, 
where 


PM = D + ps, PR= py t+ Pv, (6.2.16) 


so that to a good approximation a œ /t and 87Gfr/3 = 1/417, while 
R < 1. (This fixes our definition of the zero of time.) If we take a x Vt 
and R < 1, Eqs. (6.2.9)-(6.2.15) become 


3 3 
byq = Bg = —Wq + (GQ? /a°)bttyq , (6.2.18) 


d g g 8. 8. 
Thiva) SAG (Av. ont origa Prd) =. 6217) 


2Note that this differs from a commonly used convention, according to which 5¢q would be defined 
as dpag/Pa- 
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Dq = —-Wq (6.2.19) 
vq = — Wq + (GV? /a?)8Uyq » (6.2.20) 
2 He 1) = = 6.2.21 
at\ ft Say ee (6.2.21) 
d (bUyg 1 
7 = —— by, . v2 2? 
lea an Sree 


At very early times the perturbation was outside the horizon, in this sense 
that g/aH < 1, but we have not yet dropped the terms in Eqs. (6.2.18) and 
(6.2.20) proportional to q?, because in some modes there are cancelations 
in the calculation of the conserved quantity R, outside the horizon that 
require us to take such terms into account. Also, we have not dropped 
the terms on the right-hand side of Eq. (6.2.17) proportional to pp or pz, 
because even though we are now assuming that pp and pg are much less 
than p, and p,, we want to leave open the possibility of modes in which the 
fractional fluctuations in the dark matter and/or baryon density are much 
larger than the fractional fluctuations in the photon and neutrino densities. 


Mode 1 


This is the dominant adiabatic mode — adiabatic, in the sense that all the 
dag become equal at very early times. (As discussed in Section 5.4, only these 
modes are present in inflationary theories with a single scalar field, or if the 
universe was ever earlier in a state of complete local thermal equilibrium 
with no non-zero conserved quantities.) Inspection of Eqs. (6.2.17)-(6.2.22) 
shows that, if we make the ansatz, 


ôyq = Bq = ÔDq = vq = ôq, — SUyg = Uyg = Ug , (6.2.23) 


and if we now drop the baryon and dark matter-dominated terms in 
Eq. (6.2.17), and consider times early enough so that we can drop the q? /a? 
terms in Eqs. (6.2.18) and (6.2.20), then Eqs. (6.2.18)—-(6.2.22) and (6.2.17) 
become 


bg = — Yy ,» (6.2.24) 
d (dug 1 
eed Ps = > 52.2) 
dt (=) 3./t ae oid 
and 
d 1 
Sheva) = -ő . (6.2.26) 
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Combining Eqs. (6.2.24) and (6.2.26) gives a second-order differential equa- 
tion for ôq: 


This has two solutions, with 6, « ¢ and 6, œ 1/t, and for each solution 
Eqs. (6.2.24) and (6.2.25) give solutions for Yq and du,. The growing solu- 
tion has 6, « t and dug X t, and gives our first adiabatic mode: 


PEPR? 
yq = Ong = vq = Dq = EA (6.2.27) 
t R? 
Peys E 1 (6.2.28) 
2P R? 
bUyg = bUyg = mmga š (6.2.29) 


We have normalized this mode so that the quantity given by Eq. (5.4.24) as 
Q Rq = —@ Hq + 41 Gasp + qH ôug (6.2.30) 


takes the time-independent value PRE for g/a &« H (the superscript o 
standing for “outside the horizon”). 


Mode 2 


The solution of Eqs. (6.2.24)-(6.2.26) which goes as ôg « 1/t for t > 0 gives 
us our second adiabatic solution: 


2 2€q 


(6.2.31) 


with €, an arbitrary time-independent function of q. The calculation of 
Ry for this solution has the problem that the first two terms in Eq. (6.2.30) 
for q’Rzq cancel to zeroth order in q°/a?H?, leaving us with an unknown 
residue in PRq of order q?°/a?H?, and hence an unknown term in Rq 
of zeroth order in g?/a*H?. Fortunately, in this mode we can find a sol- 
ution to Eqs. (6.2.17)—(6.2.22) that is valid to all orders in g/aH, as long as 
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PM X PR: 


€ 2 
A AE yp Se hy ny = [ =e = In (=) , (6.2.32) 


t 3H? a? 
Eq OL ad 
w= [ +55 | , (6.2.33) 
2 
EL EE =! (6.2.34) 


Using this in Eq. (6.2.30) shows that this mode has R4 = 0 to all orders in 
q/aH as long as pm & pr. 

The other five modes are non-adiabatic, in the sense that some of the dug 
are unequal even for g/a «< H. One particularly simple mode can serve as 
an illustration: 


Mode 3 
€ PB €gPD 
ôD = ees hee > ôB R a (6.2.35) 
s PB + PD a PB + PD 


again with eq time-independent but otherwise arbitrary. This just amounts 
to a perturbation in the time-independent ratio of the densities of baryons 
and dark matter, and is an exact solution for all times. It is an isocurvature 
mode, in both the sense that R, = 0, and also that y, = 0. 


x k k 


As already indicated, these results apply only at times early enough so 
that Om < pr (and, for mode 1, q/a « H). As an aid to extending these 
early-time solutions to later times, note that for all times before recombin- 
ation, the difference of Eqs. (6.2.10) and (6.2.12) gives 


© (5q ~ 8,4) =0. (6.2.37) 


We see that any solution that satisfies the adiabatic condition, that 63, = dy,4 
at early times, when the perturbation is far outside the horizon, will continue 
thereafter to have 

dB = Sbyq - (6.2.38) 
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Eqs. (6.2.9)-(6.2.15) are then reduced to 


d 8. 8 _ 
Fhe) = —47 Ga? [Povu + (i + sr) Sygt S Boug , (6.2.39) 
byq — (Q /@ suya = -Y4 » (6.2.40) 
ông = -V4 » (6.2.41) 
bug = (q /a7)5tyq =-VW> (6.2.42) 
d (A+ R)du 1 
i ( a r) = -zzôyg > (6.2.43) 
d (du, 1 
dt ( wt) = — 3,54 > (6.2.44) 


where again R = 30g/4pr. These equations apply to modes | and 2 at all 
times before recombination. We will be chiefly interested in mode 1, since 
the perturbations of mode 2 decay by a factor 1/¢? relative to those of mode 
1 during the part of the radiation dominated era when the perturbation is 
still outside the horizon. 

It is not possible to find an analytic solution of even the reduced set 
of equations (6.2.39)—(6.2.44) that would be valid for all times and wave 
numbers. They can be treated analytically, however, in two wavelength 
regimes: long wavelengths, for which q « qrq, and short wavelengths, for 
which q > gga, where gEQ is the wave number for which g/a = H at 
matter—radiation equality. Recall that, once inflation is over, g/a decreases 
more slowly than H, so that for long wavelengths, the wave number is so 
small that pm becomes equal to or when g/a is still much less than H, 
while for short wavelengths, the wave number is so large that q/a becomes 
equal to H when py, is still much less than or. For long wavelengths we 
will be able to find analytic solutions of Eqs. (6.2.39)-(6.2.44) in both the 
early era, when the perturbation is outside the horizon, and in the later 
era, when the expansion is dominated by non-relativistic matter, and patch 
them together in the era of overlap, when the perturbation is still outside the 
horizon and the universe is already matter dominated. Conversely, for short 
wavelengths we will be able to find analytic solutions of these equations in 
the early era when the universe is radiation dominated and in the later era 
when the perturbation is deep inside the horizon, and patch them together 
in the era of overlap, when the universe is still radiation dominated and the 
perturbation is already deep inside the horizon. The two cases of long and 
short wavelength are considered in Sections 6.3 and 6.4, respectively. 

To calculate the critical wave number greg for which g/a = H at 
matter—radiation equality, we recall that the redshift of matter—radiation 
equality is given by Eq. (2.2.5) as 1+z£Q = Qu /QR = Qyh?/4.15 x 1075. 
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Eq. (2.2.9) gives the Hubble rate during the radiation dominated era as 
H =2.1 x 107% (1 + z)?s7!, but at radiation—matter equality the contri- 
bution of matter to the total energy density makes H larger by a factor /2, 
so for fluctuations that just enter the horizon at matter—radiation equal- 
ity the physical wave number and Hubble rate at that time are given by 
Eq. (2.2.9) as 


qrQ/anqg=HEqg=V2 x2.1x 1077? (1+zgQ) s7! =1.72x107!! (Qyh?)? s7! . 
This corresponds to a critical physical wavelength at present given by 


_ 27 = 2a(1 + ZEQ) 
~ EQ/a0 GEQ/AEQ 


do = 85 (Qyh’)"| Mpc. (6.2.45) 


For comparison, the size of the local supercluster, estimated from the 
distance between our galaxy and the Virgo cluster, is about 15 Mpc. 
Perturbations that are now observed to extend over distances that are larger 
or smaller than Agg came within the horizon before or after matter—radiation 
equality, respectively. Using the present mass density omo given by 
Eq. (1.5.28) and (1.5.39), the average mass now contained within a sphere 
of diameter Ag is 

T 


7 pmoay = 0.9 x 10! (Qyh?)~? Mo . (6.2.46) 


This may be compared with the mass of a large galaxy, about 101? Mo. 
Thus any perturbation that is relevant to the formation of galaxies or even 
clusters of galaxies would have been well within the horizon at the time of 
radiation-matter equality. 

We can also identify a corresponding critical multipole order £gQ of 
anisotropies in the cosmic microwave background. As remarked in Sec- 
tion 2.6, the integral over wave numbers for the multipole coefficient Cge is 
dominated by co-moving wave numbers of order £/rz, where rz is the radial 
coordinate of the surface of last scattering, and hence the integral for C¢ is 
dominated by wave numbers of order qgo that just come into the horizon at 
matter—radiation equality if £ is of order gq = qerz. This can be written 


qEQ \ ( 2EQ \ ( 4 HeQ(1 + Zz) 
LEO = arre, = ———€Q IL , 
= ee ( ao ) & CE S 
where az is the Robertson—Walker scale factor at last scattering. We recall 


that Heo = V/2Qu (1 + 28Q)?/7Ao, and azr = da, the angular diameter 
distance of the surface of last scattering. Also, 1 + zeg = Qy/QrR. Hence 


LEQ = QuV2/QrAod4,0 + ZL). (6.2.47) 
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For instance, if we take sample parameters Q yy = 0.26 and Q, = 0.74, then 
as noted in Section 2.6 d4 = 3.38Hy '(1+zz)7!. Taking Qpr = 8.01 x 1075 
(corresponding to To = 2.725K and Ho = 72 km sec~! Mpc7!), Eq. (6.2.47) 
gives a critical multipole order £go = 140. Multipole coefficients for larger 
values of £ arise only from perturbations that entered the horizon during 
the radiation dominated era. 

In what follows, for both scalar and tensor modes,we will find it conve- 
nient to introduce a dimensionless rescaled wave number, 


[2q _ (Q/R _ q/a0 


Eq Hom 0.052 2h? Mpc! ` 


(6.2.48) 


In the cases of long and short wavelength, we have x « 1l andx > 1, 
respectively. 

The calculations of the next three sections are necessarily complicated 
and perhaps tedious. As a guide, it may help to say that the results at which 
we are aiming are Eqs. (6.5.15) and (6.5.16) for the perturbations to the dark 
matter density and gravitational field in the whole of the matter-dominated 
era, and Eqs. (6.5.17) and (6.5.18) for the perturbations to the photon and 
baryonic plasma density and velocity potential in the matter-dominated era, 
up to the time of the decoupling of matter and radiation. 


6.3 Scalar perturbations — long wavelengths 


We first consider perturbations with wavelengths that are long enough so 
that they are still outside the horizon at the time of radiation—matter equality. 
As discussed at the end of the previous section, such perturbations are 
responsible for multipole moments of the cosmic microwave background 
anisotropies with £ < 140. Because gpr/aHpy is constant during the 
radiation dominated era, when a «x Jt and H œ a~?, and we are here 
assuming it is much less than one when py = ØR, it follows that for these 
wavelengths we have i 
q PM 
aH < PR 
throughout the radiation-dominated era. (Recall that ðm = Øp + Øg and 
PR = Py + py.) On the other hand, during the matter-dominated era when 
a x ÊP and H « a`’, it is q°5r/a? H? py that remains constant, and 
since this quantity is assumed to be much less than one when py = pr, for 
these wavelengths we have 


(6.3.1) 


2 = 
fa eM, (6.3.2) 


a? H? OR 


throughout the matter-dominated era. 
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We cannot give a single analytic formula for these perturbations during 
the whole era from just after electron—positron annihilation until near the 
present, but fortunately we can find analytic solutions in two eras: first, 
the era when the perturbations are outside the horizon, and, second, the 
era when the energy density of the universe is dominated by non-relativistic 
matter. (The anisotropic inertia due to neutrinos can be neglected in both 
eras, because it is negligible outside the horizon, and irrelevant when the 
energy density of neutrinos is much less than that of matter.) For the long 
wavelengths considered in this section, that are still outside the horizon at 
the time that the matter density becomes equal to that of radiation, these 
eras overlap. This allows us to take the initial condition in the second era 
from the results for the first era in the period in which they overlap. 


A. Outside the horizon 
The perturbations of greatest interest, corresponding to adiabatic modes, 


are governed by Eqs. (6.2.39)-(6.2.44), with the fractional perturbations dy 
defined by Eq. (6.2.8), and with dg = 5,9. For g/a < H, this gives 


d : 8. 8. 
ales) = —47 Ga? [Pvn + (2 ih sr Bra i hb (6.3.3) 


3 3 
byq = bq = bq = —Wq (6.3.4) 
d (A+ R)éu 1 
ai (ae) = 3574 . (6.3.5) 
d (duy 1 
7 ( wt) = — 3-504 (6.3.6) 


These can be solved analytically, at least for some of the modes described 
in the previous section. Most importantly, for the adiabatic solutions with 
all dug equal outside the horizon, Eqs. (6.3.3) and (6.3.4) become 


ae on 
5 (4 va) = —4n Ga Au F se ôq» (6.3.7) 
5S Sigs (6.3.8) 
where 
En an en eee (6.3.9) 


and we recall that py = pg + Pp and pr = py + py. Inserting Eq. (6.3.8) 
in (6.3.7) then gives a second-order differential equation for ôq: 


8 
g (ea) = 4r Ga? au + Sø] g » (6.3.10) 
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To solve Eq. (6.3.10), it is very convenient to replace the dependent variable t 
with y = a/agQ = Pm/PR, Where apo is the Robertson—Walker scale factor 
at matter—radiation equality. Then py = peQ/y? and pr = pEQ/y*, where 
PEQ is the common density of matter and radiation when they are equal. 
Using the Friedmann formula for the expansion rate, we have then 


a OV EY A (6.3.11) 
dt J y d’ 


and Eq. (6.3.10) becomes 


d d 3 8 
1 — 1 ôg | — ôg = .3.12 
yv FS (ov rA :) TODE 0 (6.3.12) 
This has two independent solutions, 


6 =y7(16+ 8y — 2y? + y°), SO =y? V/1+Fy. 


We are looking for a solution that, as found for Mode 1 in the previous 
section, vanishes like t œ y? for t > 0, so we must take our solution to be 
proportional to 6 — 166), which for y —> 0 approaches 5y?/8. Adjusting 
the normalization of this solution to match Eq. (6.2.27), we have then 


49° Re By 
a Soap (16+ 8-2 +y - 16/1F7), (6.3.13) 
5H2 ok! 
V2y a ae 
5 a 6.3.14 
CE BEE CATR) 6 ray oe ies 
Vy P dy 
Sipe 80’), 6.3.15 
Uvq 3H Jo JEY qQ) ( ) 
VaR 
k aa (iF (32+ 8y- P) -6 -48y), (6.3.16) 
Hol 


With this normalization factor, R, takes the time-independent value RI 
outside the horizon. 


B. The matter-dominated era 
We can also solve Eqs. (6.2.39)—(6.2.44) analytically in the matter-dominated 


era, when Om >> pr, whether or not the perturbation is outside the horizon. 
For simplicity, we will also assume that pg < pp. (Their ratio is actually 


284 


6.3 Scalar perturbations — long wavelengths 


about 0.2.) However, since p, is also much less than pp in the matter- 
dominated era, we will not assume that pz is negligible compared with p, in 
this era. Recall that in the matter-dominated era! a œ t?/3 and 82 Gpy /3 = 
H? = 4/917. If we keep only the term in Eq. (6.2.39) proportional to fp, 
but now make no assumption about the relative magnitude of q/a and H, 
then Eqs. (6.2.39)-(6.2.44) become 


L (Ay) = -Eran (6.3.17) 

yq — (7/8) 8uyq = -Vq (6.3.18) 

bug — (q?/@) Suva = —Wa (6.3.19) 

bg = -W4 » (6.3.20) 

T (PA + Ruyg) = =ar iz (6.3.21) 
T (3109) = =f tng (6.3.22) 


where once again R = 3pp/4pr & a. 

There are two solutions to Eqs. (6.3.17) and (6.3.20), one with yg œ 1/3 
and ôpq « 17/3, the other with wy « t7? and ôpg œ t~}. To evaluate the 
coefficients of these solutions in mode 1, we must compare these results with 
those given by Eqs. (6.3.13) and (6.3.16) in the era where both sets of results 
apply, the era (which exists because of our assumption of long wavelength) 
when both g/a « H and pr « py, 1. e., y > 1. Eqs. (6.3.13) and (6.3.16) 
give in mode 1 


4q°yRi 9 ORG 


Sp = , 6.3.23 
PT SH2otzg 10a? g 
-2420 R9 3q tR? 
Wy = soe eae (6.3.24) 
SHEQ4EQY / Sa 


(The final expression in Eqs. (6.3.23)-(6.3.24) is derived using the result 
that for fr < pp, the Hubble rate is H = 2/3t = Hpg/V2y?/*, and 
eliminating Hgo.) For a œ 1/3, these match the solution with ôpg œ 17/3 
and Wy x t—!/3, while it is easy to see that Mode 2 gives the other solution 
in the matter-dominated era, the one with Yq « t? and ôpg œ t71. We 
conclude that if Mode 1 is dominant outside the horizon, then the dark 


! The zero of time is chosen in this subsection so that in the matter-dominated era a œ 17/3. In other 
words, ¢ could here be defined as 2/3H. 
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matter density perturbation and gravitational field perturbation are given 
in the matter-dominated era by Eqs. (6.3.23) and (6.3.24). 

This is a good place to pause, and make contact with the more elementary 
treatment of Section 2.6. Under the assumption that the fluctuation in 
the Newtonian gravitational potential (x,t) at around the time of last 
scattering is dominated by fluctuations in the dark matter density, its Fourier 
transform is given by using Poisson’s equation, with the Laplacian replaced 
with —g?/a: 


ô$ D = -4r G(?(0)/4") Spp(q,1) = -4r G (0/4) oO - 


Using the Mode 1 solution Eq. (6.3.23) and the Friedmann equation H? = 
(2/3t)* = 82 Gpp/3, we see that ô¢ (q, t) takes the time-independent value 


3 
ôġ (q) = -300R : 


Then (5¢(q)6¢(q')) = Pg (q)53(q + q), with the correlation function Po 
introduced in Section 2.6 equal to 


9 
Po) = zlRal - 


The behavior Pg(q) = Nog? that was found in Section 2.6 to yield a 
temperature multipole coefficient Cy = 87 N 3 /9€(€ + 1) thus corresponds 


to the assumption that IRE? = N*q-3, with Nb = 9 N? /25. 

Even if we assume that the gravitational field is dominated by cold dark 
matter, we must still consider the perturbations to the photon density and 
the photon—baryon velocity as preparation for calculating the contribu- 
tion of intrinsic temperature fluctuations and Doppler effect to the cosmic 
microwave background anisotropies in the next chapter. Equations (6.3.18) 
and (6.3.21) are a pair of coupled inhomogeneous first-order differential 
equations for the two unknowns 4,, and éu,,, with a forcing term propor- 
tional to Yq. Surprisingly, with yq given by Eq. (6.3.24), there is a simple 
exact solution of these equations: 


242 o 3-20 
wa 3q + 3RR D 3E RG 


_ = “6335 
Y1 5a?(t?q?/a* + 2R) “v4 5a? (t?q?/a + 2R) Á ) 


(We will not be concerned with the remaining equations, (6.3.19) and (6.3.22), 
which are needed only to calculate the neutrino perturbations.) 
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To the particular solution (6.3.25), we must add a suitable solution of 
the homogeneous version of Eqs. (6.3.18) and (6.3.21), 


d/_ le 
by = (P/a Sug > = (i 7/31 + RuG) = ee 
(6.3.26) 


or, eliminating the velocity potential, 


2 
“(i (: AUER Zag) + te =0, (6.3.27) 


with coefficients chosen so that 50 + so matches the solution found earlier 
outside the horizon, when q?t?/a? < 1. 

Using the fact that in the matter dominated era R œ a œ t’, we can find 
a general solution of Eq. (6.3.27) as a linear combination of the functions 


1 1 


1 1 l 
r3- ztqvi-i&, 5, -R) , 
l 3 1 3 
VRF (3 -iV +y 16 5. -R). 


where F is the Gauss hypergeometric function (also known as 2F1), and n 
is the quantity 


~ 4a2R? 


which is time-independent during the matter-dominated era. Unfortu- 
nately, this does not provide much insight into the behavior of the solutions. 
Instead, at this point we will make the further assumption that the wave- 
length is only moderately long, in the sense that n > 1. That is, although q 
is small enough so that g/aH < 1 at matter—radiation equality, we assume 
that it is sufficiently large so that throughout the matter-dominated era we 
have 


R<«tq’/a K pm /pr. (6.3.28) 


Each term in this inequality is proportional to £? during the matter- 
dominated era, so if Eq. (6.3.28) holds at any time during this era, then 
it holds throughout it. This assumption will allow us to find solutions 
of Eq. (6.3.27) in terms of elementary functions. In the opposite case of 
extremely long wavelengths, for which £q? /a? « R throughout the matter- 
dominated era, we can solve the general solution of Eqs. (6.3.18) and (6.3.21) 
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as a power series in the quantity 17q7/a7R. We are more interested here in 
the case of moderately long wavelengths, because as we will see in Section 
6.5, it is this case that can be connected to the case of short wavelengths by 
a smooth extrapolation. 

For perturbations satisfying the inequality (6.3.28), when the perturba- 
tion is outside the horizon and for some time after it re-enters the horizon, 
we will have R « 1. During the period when R < 1, the homogeneous 
equations (6.3.26) have the exact solution: 


82) = cq cos(W3qt/a) + dy sin(V3qt/a) , (6.3.29) 
yg = 5 [-c sin(v3qt/a) + dy cos(v3qt/a)| , (6.3.30) 


with cg and d, constant. To this, we must add the inhomogeneous solution 
(6.3.25), in the limit R « q?t?/a?: 


3(1 + 3R)RG Ste 


dd) _ (1) _ q 
Ong = 5 : buy paid ar (6.3.31) 


We can evaluate the constants cq and d, by requiring that for gt/a « 1 
(which according to Eq. (6.3.28) also implies that R « 1), the total photon 
density perturbation 3’) + 5) must approach 3py > 9¢°tR?/10a*. This 
gives 


SRG 
Cq= ers d,=0, (6.3.32) 
so that, as longas R < 1, 
ZR? 
ss sa cos(W3qt/a) , (6.3.33) 


Eventually R becomes non-negligible, but under the assumption (6.3.28), 
by then gt/a will be much larger than one, and we can solve the homoge- 
neous equations (6.3.26) using the WKB approximation. Inspection of 
Eq. (6.3.27) suggests that for gt/a >> 1 the density fluctuation will oscillate 
rapidly, with phase 


_ f qat J3qt 
o=] mae = aq in(VR+v1FR). (6.3.34) 


Using ọ as the independent variable instead of t, and recalling that during 
the matter-dominated era a « 17/3, Eq. (6.3.27) becomes 


@Syqg 1 (din +R)\ dôyq 
dọ 2 do do 
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We try for a solution of the form Ae*'?, with A varying slowly with ¢, 
so that we can neglect the second derivative of A in the first term on the 
left and neglect the first derivative of A in the second term on the left, 
which is already small because of the factor dIn(1 + R)/dg. This gives 
dA/dg ~ —(A/4)d In(1 + R)/dọ, and hence the general WKB solutions of 
Eq. (6.3.27) are 


BC oc (1+ R) "4 exp(ig) . 


Clearly, the linear combination of these solutions that merge smoothly with 
the results for R « 1 is obtained by replacing the argument of the cosine 
in Eq. (6.3.33) with g, and multiplying with (1 + R)~!/4. Adding the inho- 
mogeneous term (6.3.25), the total photon and baryon fractional density 
perturbations for moderately long wavelengths in the matter-dominated 
era are 


3R? 
ôq = ôm = aA [1 +3R -= (1 +R" cos ¢| (6.3.35) 


We can then use Eqs. (6.3.18) and (6.3.24) to calculate the velocity potential 


3R 


a 
bUyg = ——~ | —1 + sin | ; 
Me | Bua FRA Y 


(Here we neglect a term in square brackets of order Ra? /t°q?.) As a check, 
note that early in the matter-dominated era, when gt/a « 1, we have 
R < 1,809 > V3qt/a > 0, and hence Eq. (6.3.35) gives 5g = Bq > 
9¢°t?R?/10a*, in agreement with Eq. (6.3.23) and the condition that for 
adiabatic modes all dy, are equal outside the horizon. This condition is sat- 
isfied by Eq. (6.3.35) even if the inequality (6.3.28) is not satisfied, as long 
as both R « land gt/a<« 1. 

The results (6.3.35) and (6.3.36) for 5,7, = dg, and ôu,q in the case of 
moderately long wavelengths all apply only up to the time of last scatter- 
ing. On the other hand, to the extent that the energy density after last 
scattering is dominated by dark matter, 5p, and wy are unaffected by the 
decoupling of radiation from the baryonic plasma, and continue to be given 
by Eq. (6.3.23) and (6.3.24) until either vacuum energy and possibly spatial 
curvature become significant or the perturbations become too strong to be 
treated as first-order perturbations. 


(6.3.36) 


6.4 Scalar perturbations — short wavelengths 


We next consider adiabatic perturbations with wavelengths that are short 
enough so that they are already well within the horizon at the time of 
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radiation—matter equality. As discussed at the end of Section 6.2, such per- 
turbations are responsible for multipole moments of the cosmic microwave 
background anisotropies with £ >> 140, and also for the onset of gravita- 
tional condensations that lead to the formation of structures on the scale 
of galaxies or clusters of galaxies. Following the same reasoning as at the 
beginning of the previous section, for these wavelengths in the radiation- 
dominated era 


1 ny M 


; 6.4.1 
aH” PR ( ) 
and in the matter-dominated era 
iD) a, 
q PM 
ae 6.4.2 
a? H2 d OR ( ) 


where as before pm = pp + pg and pr = py + py. 

As shown in Section 6.2, because we are considering adiabatic pertur- 
bations, the fractional perturbations dyy = 5paq/(Pa + Pa) are subject to 
the condition 5g, = 5, g. Consequently the perturbations are governed by 
Eqs. (6.2.39)-(6.2.44). Again, we cannot give a single analytic solution of 
these equations during the whole era from just after electron—positron anni- 
hilation until near the present, but fortunately we can find analytic solutions 
in two overlapping eras: first, the era when the energy density of the uni- 
verse is dominated by radiation (photons and neutrinos), and second, the 
era when perturbations are well within the horizon. 


A. The radiation-dominated era 


We can solve Eqs. (6.2.39)—(6.2.44) analytically in the radiation-dominated 
era, when pr > py. We will assume tentatively that at this time the photon 
and neutrino density fluctuations on the right-hand side of the gravitational 
field equation (6.2.39) dominate over the dark matter density fluctuations, 


an assumption we will check later in this section. In this era a œ t!/*, so 
Eqs. (6.2.39)-(6.2.44) take the form 
d 327 Gt /_ 7 
= (ta) = ~~ (Byra $ Pvdvq) (6.4.3) 
yq — (q?/a*)buyg = -Yq » (6.4.4) 
dDqg = —Wq > (6.4.5) 
d (1/2 =a slui 
= (1? suyq) = -58,4 (6.4.6) 
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bug — (G7/07)8tvg = —Wq ; (6.4.7) 


ree ie 
= (i 51,9) = nt bag (6.4.8) 


We are interested in adiabatic solutions for which all dy, and dug, become 
equal at early times, so since the differential equations here are the same for 
photons and neutrinos, these adiabatic solutions have 


Syg = bvg,  Sltyg = SUlyg - 


For these modes, Eq. (6.4.3) now simplifies to 
d 327 Gprt 1 
= (ta) = aaa ae 2. = =7ôva . 
Also, assuming that the cosmological perturbations are in the growing 


adiabatic mode, Mode 1, they satisfy the initial conditions (6.2.27)-(6.2.30): 
For q/aH « 1 


242pDo 242—Do 
qR qR 
yq = öB4 = öy > — y“ 1 8b > 5 q (6.4.9) 
2790 3 .2po0 
tq R 2P R 
Y > nE ,  bUyg = bUyg > ae (6.4.10) 


where Ra is the value of R4 outside the horizon. The reader can check that 
the solution of Eqs. (6.4.3)-(6.4.8) satisfying these initial conditions is! 


Bf 2 % 2 2 
ôyq = 8Bq = vq = IRI oe = IS COS 2 ; 


(6.4.11) 
y= e (Z sino + 5 cos 0 - J; -1) ; (6.4.12) 
so = 6r f (<rsino + e080 - 5-55) odd, (6.4.13) 
Suyg = uvg = 4R? (= ia = 2) l (6.4.14) 


1By replacing ¢ with © as the dependent variable, Eqs. (6.4.3), (6.4.4), and (6.4.6) are put in the form 
of a parameter-free third-order system of differential equations for 5)g = ôgq = 5vq, dUyq/t = Suyq/t 
and tw. After finding the solution that matches the initial conditions (6.4.9), (6.4.10), we can solve 
Eq. (6.4.5) for ôpq by an integration. Aside from normalization, this solution for the various fluctuations 
is equivalent to that given for the Newtonian potential in a different gauge in Eq. (48) of S. Bashinsky 
& E. Bertschinger Phys. Rev. D 65, 123008 (2002). 
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where 
0 = —. (6.4.15) 
3a 
Note that the fractional perturbations 5, are all of the same order of mag- 
nitude for moderate values of ©. This justifies the neglect of the matter 
terms in Eq. (6.2.39) when Or is much greater than pp and q/aH is not 
very much larger than unity. The condition for continuing to neglect the 
matter terms in Eq. (6.2.39) for perturbations deep inside the horizon, when 
© becomes large, will be discussed below. 


B. Deep inside the horizon 


We can also find a solution of Eqs. (6.2.39)-(6.2.44) when the wavelength 
is well within the horizon, in the sense that g/a >> H, whether or not pp 
is negligible compared with pr. For g/a >> H we can distinguish two 
different kinds of solutions: “slow modes,” for which time derivatives yield 
factors of order H, and “fast modes,” for which time derivatives acting 
on the perturbations yield factors of order of g/a, as well as other terms 
with factors of H instead. Eqs. (6.2.39)-(6.2.44) are a sixth-order system 
of differential equations, so they have six independent solutions. We are 
going to identify four independent fast solutions and two independent slow 
solutions, so we can be sure that there are no solutions other than what we 
have called fast and slow modes. 


1. Fast modes 


Up to now we have ignored the neutrino anisotropic inertia, but for the 
rapidly oscillating fast modes we must take into account its effect of the 
long neutrino mean free path in damping the neutrino density and velocity 
perturbations. In considering the fast modes deep inside the horizon, we 
shall simply assume that this damping allows us to ignore 5,4 in Eq. (6.2.39). 
Turning to the other perturbations in a fast mode with fractional rates of 
change of order q/a, Eq. (6.2.41) shows that dp, is of order wy/(q/a), so the 
dark-matter term on the right-hand side of Eq. (6.2.39) is 
of order 
An Ga” ppWq/(q/a) < 3H? a Wq/2(q/a) , 


while the left-hand side of Eq. (6.2.39) is of order (q /a)aWq, and the dark 
matter term is therefore less than the left-hand side of Eq. (6.2.39) by a 
factor less than of order < H*/(q/a)”, which deep in the horizon is much 
less than unity. Dropping the dark matter term in Eq. (6.2.39), we see that 
dyq must be at least of order (q/a)Wq/H 2. Eq. (6.2.43) then shows that 
duyg is of order 5,,/(g/a), and hence at least of order wy /H 2. Both terms 
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on the left-hand side of Eqs. (6.2.42) are then larger than yq by factors 
of order g?/a?H* >> 1, so we can drop Yq on the right-hand side. (That 
is, because the wavelength is short, pressure gradients exert a much larger 
force on the baryon—photon plasma than gravitation.) Hence if we neglect 
all terms in Eqs. (6.2.40), (6.2.43), (6.2.39), and (6.2.41) that are suppressed 
by factors H?/(q/a)* (but not terms arising from derivatives of a that are 
only suppressed by factors H /(q/a)), these equations become 


yq = (Gq? /@ uyg (6.4.16) 
d (A+ R)su 1 
T (u Ht) = -> 8yq> (6.4.17) 
d 162 Ga? _ 
Fhe) ee DARA Dox (6.4.18) 
Dq = —Wq (6.4.19) 


where, as before, R = 30g/4p,. Equations (6.4.16) and (6.4.17) have two 
independent solutions for 6,7 and du,,. Given these solutions, and looking 
only for fast modes, Eq. (6.4.18) then has a unique solution for wy, and 
Eq. (6.4.19) then has a unique solution for dp,. (Possible constant terms 
that might be included in the solutions for aby or dpg would contribute to 
the slow modes, not the fast modes.) Together with the two strongly damped 
solutions of Eqs. (6.2.42) and (6.2.44) for the neutrino perturbations, there 
are four independent fast modes, as promised. 

By eliminating 6u,,, from Eqs. (6.4.16) and (6.4.17), we obtain a second- 
order differential equation for 5,, alone 


d d 
= (aa j RB) 5,4) re Lö =0. (6.4.20) 
If a and R were constant, this would be just the wave equation for a sound 
wave, with physical wave number g/a and velocity? vs = 1/./3( + R). 
With a and R varying at a relatively slow fractional rate H, Eq. (6.4.20) can 
be solved for g/a >> H by the WKB approximation. For this purpose, we 
introduce a new independent variable, the phase g = q i dt/a/3(1+ R), 
and rewrite Eq. (6.4.20) as 


Aoi de 1 din( + R) db,,g 
dọ 2 do do 


2Note that the condition of constant entropy gives dog/PB = dpy / (By + Py) = (3/4)dpy/py, so 
v? = dp/dp = dpy [3(dpy + dpg) = 1/30. + R). 
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Writing ôq = Aexp(+ig), and neglecting d? A/dg? in the first term on the 
left and neglecting d.A/d¢ in the second term (which is already small because 
of the factor d In(1 + R)/d@), we find dA/dg = —(A/4)dIn(1 + R)/dg, so 
the WKB solutions are? 


i dt 
+ _ —1/4 ; 
až, = (1 +R) "4 exp Ea as Ma (6.4.21) 


There is a further complication that must be taken into account for fast 
modes, though it is not important for slow modes. The amplitude of a 
sound wave whose physical wave number k is larger than the inverse mean 
path of the particles in a relativistic medium is damped by viscosity and heat 
conduction, at a rate given in general by* 


=, Je a r i 
~ 2p +p) NOE 


Op 2 dp n (op 
—2T T — > 6.4.22 


where n, x, and ¢ are the coefficients of shear viscosity, heat conduction, and 
bulk viscosity, respectively, defined in Appendix B; vs is the sound speed; 
n is any number density on which the fluid properties may depend; and 
subscripts indicate the quantities held constant in taking partial derivatives. 
For the baryon—photon fluid, for which n is the baryon number density, we 
have p = pB + py, p = py/3, with pgp x n and py « T*. Also, as we 
have seen v? = 1/./3(1 + R). Hence, (setting k = q/a) the damping rate 
becomes 


3q? 


~ 8a2py(1 + R) 


4 x TR? 
6.4.23 
s TELNE PS 
The viscosity and heat conduction coefficients for photons interacting with 
a non-relativistic plasma with mean free time t, = 1/o7ne are 


16 


4 
grv» XT = zPyty. ¢=0 (6.4.24) 


Ga 3 


3P J. E. Peebles and J. T. Yu, Astrophys. J. 162, 815 (1970). R. A. Sunyaev and Ya. B. Zel’dovich, 
Astrophys. & Space Sci. 7, 3 (1970). 
48. Weinberg, Astrophys. J. 168, 175 (1971). 
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so in this case the damping rate is 


Gly 16 R? 
~ 6a2(1+R))15 1+R 


(6.4.25) 


The effect is to replace Eq. (6.4.21) for the fast mode amplitudes with 
+ 1/4 i dt i 
65, = (1+ R) "ex E Í == S rar ; 
4 PIERE aV3(I +R) Jo 
This damping of the fast modes is known as Silk damping. With this result, 
Ug for these fast modes can be obtained from Eq. (6.4.16), WF can be 


obtained from Eq. (6.4.18), and then 55 q can be obtained from Eq. (6.4.19). 


(6.4.26) 


2. Slow modes 


For solutions whose fractional rate of change is of order H = O(1/t), we 
can run through the same sort of counting of powers of q/aH as for fast 
modes, but with very different results. From Eqs. (6.2.41), (6.2.43), and 
(6.2.44), we see that Sp, is of order wy/H while 6,, and 5, are of order 
H8uyq and Hduyq, respectively. The terms p4 and ôvq on the left-hand 
side of Eqs. (6.2.40) and (6.2.42) are then of order H? ôuyq and H? ôuvq, 
and hence are less than the terms (q7/a7)duyq and (q? /a?)ôuyg by factors of 
order H?a?/q?, and may be dropped, giving instead 


(q° /)8uyq = (7/8) uvg = Wq , (6.4.27) 


so Eqs. (6.2.43) and (6.2.44) show that ôq and 4,, are of order (H/P bq. 
The ratios of the photon and neutrino terms on the right-hand side of 
Eq. (6.2.39) to the dark matter term are then of order 


photons & neutrinos _ 6 bp +8pr/3\ (aH 
dark matter 7 Pp q? f 


5This damping rate was first calculated by N. Kaiser, Mon. Not. Roy. Astron. Soc. 202, 1169 
(1983), and is derived here in Appendix H. The formulas for the shear viscosity and heat conduction 
coefficients are obtained by comparing formulas (6.4.23) and (6.4.25) for the acoustic damping rate, 
taking into account that the bulk viscosity vanishes because energy and momentum are transported by 
relativistic particles; see ref. 4. The damping rate calculated by ref.4 had given the correct values for x 
and ¢, but it gave a value for 7 that was 3/4 the correct value of Kaiser, quoted here in Eq. (6.4.24). This 
was because its results were based on calculations of L. H. Thomas, Quart. J. Math. (Oxford) 1, 239 
(1930), that had assumed isotropic scattering and ignored photon polarization. (The same value for n 
had been given earlier by C. Misner, Astrophys. J. 151, 431 (1968).) Kaiser’s results are calculated using 
the correct differential cross section for Thomson scattering and take photon polarization into account, 
and therefore supersede the value for n quoted in ref. 4 and Chapter 15 of G&C. 

6J, Silk, Nature 215, 1155 (1972). 
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where as before pr = Py + Py. The term in the numerator proportional to 
pB contributes much less than unity to this ratio, because pg/pp œ% 1/5 
and we are now assuming that aH/q « 1. According to Eq. (6.4.2), 
the contribution of the term proportional to pr is much less than unity 
throughout the matter-dominated era. It actually begins to be much 
less than unity during the radiation-dominated era, when the ratio of radia- 
tion density to dark matter density falls below the critical value (ØR/ØD)Yerit, 


given by 
(=) = ay (4 ae (6.4.28) 
PD) crit 8 aH pp Í a 


According to Eqs. (6.4.1), the right-hand side is constant and much greater 
than unity throughout the radiation-dominated era for the wavelengths con- 
sidered in this section, so Pr/ Pp will fall below this critical value well before 
radiation—matter equality. (That is, strong pressure forces keep perturb- 
ations to the baryon—photon plasma density small enough so that their 
effect on the gravitational field is negligible once p,/pg falls below the 
critical value (6.4.28), even though the unperturbed radiation density is 
at first still larger than the dark matter density.) From then on the photon 
and neutrino terms may be neglected in Eq. (6.2.39) for the slow modes, 
yielding 


d = 

Shey) = —4n Gppa’sp, . (6.4.29) 
The remaining equations, (6.2.41), (6.2.43) and (6.2.44) are unchanged: 
Dg = — y4 > (6.4.30) 
d (A+ R)ôuyg 1 

= ——ô 6.4.31 
dt ( a 3a 1? ( ) 

d (ôuvq 1 
we ———_§ 6.4.32 
dt ( a ) 3a "1? ( ) 


Using Eq. (6.4.30) in Eq. (6.4.29) yields a second-order differential equ- 
ation for 5p,:! 


d ( dôpnq 2 = 
we = Dg . A. 
P7 (« i ) 4r Ga ppdpq (6.4.33) 


7Eq. (6.4.33) was first derived by P. Mészáros, Astron. Astrophys. 37, 225 (1974), who simply ignored 
fluctuations in the radiation energy density. The argument given here for the neglect of perturbations in 
the radiation density in Eq. (6.4.33) was given by S. Weinberg, Astrophys. J. 581, 810 (2002). It applies 
only to the slow mode part of the solution; in the fast mode it is the perturbations in the dark matter 
density that become negligible for small wavelength. This paper also gives comments on other attempts 
to justify the neglect of perturbations in the radiation density in Eq. (6.4.33). 
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It is convenient once again to convert the independent variable from ¢ to 
y = a/agQ = Pm /Pr, using the Friedmann equation 


2 
ae 82 GPE = z 
4 lea (y 34 y 4) : (6.4.34) 


p = —— (PM + PR) = 3 


with pga the values of m and pr when they are equal. Then (6.4.33) 
becomes what is sometimes known as the Mészáros equation: 


2 
ya + L +(I + z) Aa ža — b) p = (6.4.35) 
2) dy 


where 6 = as = g/m. The independent solutions of Eq. (6.4.35) 
for B = 0 were given by Mészáros,” and by Groth and Peebles® 


3y Jl +y z 
aD =142 i= (145 >) In Se 3a 
2 2 JI+y 
(6.4.36) 


Subsequently Hu and Sugiyama? gave two independent solutions for 
general 6: 


1 1 1l 
ông X (l +y) F (osos + 5,204 +> =) 


2° SA +y 
where F is the Gauss hypergeometric function and 
1+ /1+ 246 
Q+ = = a > 


In order to obtain our final results in an analytic form, we will continue 
to drop corrections proportional to B = pg/py ~ 1/6, while keeping 
those proportional to R = 3g/4/,, so we shall use the slow solutions 
(6.4.36) for B = 0. (Corrections for the finite value of 6 are discussed in 
the following section.) From these two solutions, we can find unique corre- 
sponding slow solutions for wy, duyq, dUyq, 5yq and 5,, by successive use of 
Eqs. (6.4.30), (6.4.27), (6.4.31) and (6.4.32). (The neutrino perturbations are 
of no known observational interest.) We have thus found two slow modes, 
giving six in all. 


C. Matching 


Fortunately, for small wavelength there is an overlap in the two eras in which 
we have found solutions for 5p, etc., satisfying both conditions g/a >> a/a 


ŠE. J. Groth and P. J. E. Peebles, Astron. Astrophys. 41, 143 (1975). 
°W. Hu and N. Sugiyama, Astrophys. J. 471, 542 (1996). 
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and py < pr. In this period the solution (6.4.11)-(6.4.14) found for the 
radiation-dominated era can be decomposed into a fast and a slow mode. 
Since the variable © defined by Eq. (6.4.15) is here much larger than unity, 
we have 


2 2 
Sya = Spo = Spg = SRG G sin © — (1 = 3) COS o) > 


> =3Re cos © (6.4.37) 
: BIRS /2 2 ORE 
yest = a G sin © + z cos o) => rca sin®@ , (6.4.38) 


sfast — ER? f j ( ee ae 2)» do m ©, (6.4.39) 
== —, Sin — COS => cos O, F: 
Dq q ©; 93 94 @2 


ine re) 2tR? 
ocala as Waal )- 4sin@, (6.4.40) 


fast __ fast _ 0 
upg = OUyg = URI ( 


20 02 (S) 
and 
slow = slow = slow — _ ORG (6.4.41) 
vq Bq vq 02’ a 
BRS / 2 SRI 
l — q q 
ws Ow _ _— ; & + i) > eae ; (6.4.42) 


0/2 2 2 1 
ager = ors | — sin’ + —cos? — — — — | ddd 
Dq q 0 33 94 94 92 


1 1 
+6R° -atat 0/0 | 
[h+ db +m(ore) 


1 
> 6R? (-; tytin o) f (6.4.43) 
0 
5ySlow = 5yslow — ARa (6 4 44) 
yq vq @2 Š S 


where again © = 2qt/v3a, ©; is any constant in the range 1 « ©) < O, 
and y = 0.5772... is the Euler constant.!° 


10To evaluate the asymptotic limit of the integral in Eq. (6.4.43), we can rewrite this integral as the 
sum of three terms, each of which converges at 3 = 0: 


PU(A aka, 12 en sa Bd gta to 2\ 4 
[ (Zso ae PT =) >= (Zso aa OS =) DA 
fo (ee 1 ie ef 1 1) do 
Jo v 1+8) 9 Jo 1+0 v` 
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Matching the solutions in the radiation-dominated era and deep within 
the horizon is straightforward for the fast modes. In the radiation-dominated 
era, when R < 1, damping is negligible (because the mean free time is very 
short), and a « „t, the argument of the cosine in Eq. (6.4.37) may be 
expressed as the integral 


[ dt ue 2qt 

"Jo BOTRA a3 

Hence the linear combination of the two fast solutions (6.4.26) that fits 
smoothly with the result (6.4.37) is: 


fast __ 


ZR? t t dt 
paraoa H d ( / | . 6.4.45 
va SUERA" we Nl. a BCE) ae) 


By a successive use of Eqs. (6.4.16), (6.4.18), and (6.4.19) (and ignoring 
the time dependence of all factors except the rapidly oscillating sines and 
cosines), we then also find that for g/a >> H: 


VIR? t t 
dsyiast = a q a fo Vat sin (J a) . (6.4.46) 
vq ~~ qd + R4 9 a/3(14+ R) 


as O. Yom i k 
yi t= 16/3 G5, 2+R)(1+R)"*(a/q)R?e So Tat sjin (J wre) 


: . dt 
sft — 48r Gp, (2+R)(1 +R)? (a/g? R? e7 h T4 (J a) 
Dq TG py(2+R)\1+R)""(a/q) Rye a 0 a ads) 


We will use Eqs. (6.4.45)-(6.4.48) in dealing with baryon acoustic oscilla- 
tions in Section 8.1. 

The reader can easily check that in the overlap era, when the 
universe is radiation dominated, the perturbation is deep inside the hori- 
zon, and damping is negligible, Eqs. (6.4.45) and (6.4.46) give the same 
results for nee and dulast as Eqs. (6.4.37) and (6.4.40). On the other hand, 


Eqs. (6.4.47) and (6.4.48) give results for ue and ope that differ from 


For large ©1, these three integrals approach the values —1/2, 1 — y, and — In @1, respectively, giving a 


total of 1/2 — y — In ©, which when combined with the term on the second line of Eq. (6.4.43) yields 
the quoted result. 
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Eqs. (6.4.38) and (6.4.39) by a factor p,/pr. This is because in deriv- 
ing Eqs. (6.4.38) and (6.4.39) we treated the neutrinos as a perfect fluid 
throughout the radiation-dominated era, although this is valid only out- 
side the horizon when anisotropic inertia is negligible, while in deriving 
Eqs. (6.4.47) and (6.4.48) we assumed that deep inside the horizon neutrino 
density fluctuations are so damped by anisotropic inertia that they can be 
neglected as a source of gravitational field perturbations. This discrepancy 
is a small price to pay for the simplicity gained by these approximations, 
especially since it will turn out that the slow contributions to wy and dp, are 
much larger than the fast contributions. 

Next let us consider the slow modes. Here there is a complication in 
matching solutions in the radiation-dominated era and deep inside the hori- 
zon. In deriving Eqs. (6.4.41)—-(6.4.43) we have assumed that in the radiation 
dominated era we can neglect perturbations in the baryon and dark matter 
densities as a source of the gravitational field perturbations, but in explor- 
ing the solutions deep inside the horizon we found that this assumption is 
violated for the slow modes once the radiation/dark matter density ratio 
drops below the limit (6.4.28), even if this ratio is still much larger than 
unity. Indeed, we can see from Eqs. (6.4.41) and (6.4.43) that for large 

O, Sy / gslow _, —@? In ©, so the assumption under which Eqs. (6.4.41)— 


(6.4.44) were derived breaks down once ©*1n © becomes comparable to 
PR/ Pp, Which is close to when p,/pp falls below the critical value (6.4.28). 
Therefore we have to interpolate between the results (6.4.41)-(6.4.44), which 
are valid early in the radiation-dominated era, when q7t?/a* < pr/pp, and 
the results later in the radiation-dominated era and in the matter-dominated 
era, when the Mészaros equation (6.4.35) applies. 

This is easiest for the dark matter density perturbation, because its time- 
dependence turns out to have the same form deep in the horizon in the 
radiation-dominated era both before and after Or/ Pp falls below the criti- 
cal value (6.4.28). Before this time in the radiation-dominated era pw is 
given by Eq. (6.4.43). After this time, the gravitational field perturbation 
becomes dominated by dark matter, and the dark matter density perturba- 
tion becomes a linear combination of the solutions (6.4.36) of the Mészáros 
equation, which in the radiation-dominated era when y « | become 

Sp) art 8, > —In(y/4) —3. (6.4.49) 
The linear combination of these two solutions that fits smoothly with 
Eq. (6.4.43) is then 


7 AqV/2 1 2 
slow eR? IS -Lpy tmn | A gd Sb) I (6.4.50 
Dq q 2 y V3HEQAEQ Dq l l 
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where Hgo and agg are the expansion rate and Robertson—Walker scale 
factor at matter—radiation equality. (Here we use t = 1/2H = y?/ /2HrQ.) 
The slow part of the gravitational field, radiation velocity potential, and 
radiation density perturbations are given by a successive use of Eqs. (6.4.30), 
(6.4.27), and (6.4.31) as 


yow = —550" (6.4.51) 
Suse’ = —(a?/q)d50" , (6.4.52) 
3a d d 
low __ l 


In particular, in the matter-dominated era we have y > 1, Spe —> 3y/2, and 
85) > 4/15), so Eqs. (6.4.50)-(6.4.53) become 


9R°aAT 7 Ak 

slow > q l-4 +y +ln (5) ; 6.4.54 
5 T zty 73 ( ) 
ys LN ey fa (6.4.55) 

q aEQt 2 4 V3 l = 

6R°aA Tf 7 Ak 
5 slow aaa q pee l (=) 5 6.4.56 
cane el APRAN i i 
6R@A++3R) F 7 Ak 

jslow q PEVE 1 (=) x 6.4.57 
yg. 7? apot? | ERN V3 S l 


We have here again introduced a dimensionless rescaled wave number 


„= 1V2 _ Gan V2R _ 19-3(q/a)[Mpo'] 
~ agpgHeqg = HoQu Qyh? 


(6.4.58) 


in which we have used the relations Hgg = V2(HoV/Qu)(a0/aEQ)?!” 
and ao/dgg = Qy/QR. In terms of the wave number qgo introduced 
in Section 6.2, for which the perturbation just comes into the horizon at 
matter-radiation equality, we have x = /2q/qE0; so the assumption of 
short wavelength made in this section essentially amounts to the condition 
that « > 1. 
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The full solution up to the time of recombination is given by adding the 
contributions of the fast mode and slow mode: 


i ae | ! e )| 
> — ] -—- + + n{— 
ee Geel eoe V3 


+ 48x Gpy (2 + R) +R)” a/g RI 


t 
— fi Tat q dt ) 
xe /0 cos (| BOLA ETA (6.4.59) 


lear a aE) 
> = n | — 
g aEQt 2 K V3 


+ 16/32 Gp, (2 + R) + R)" 4 (a/g R9 


t 
— h Tdt z; q dt ) 
xe J0 sin (J BAET 31 ER) (6.4.60) 


be > Eae] 
yg et A an aa 
ve agota? 2 4 J3. 


aE eT Jolt sin (f = ) 
oa 


tza + R)/4 SOFR) 
6.4.61 
ERGA +3R) T 7 4K a 
wom E ar) 
oO 
a Ra des ( Í ie) , (6.4.62) 


with T and « given by Eqs. (6.4.25) and (6.4.58). 

Each of the different perturbations (6.4.59)-(6.4.62) is dominated by 
either its fast or slow term. First, let us consider dp, and wy. Comparison 
of Eqs. (6.4.48) and (6.4.47) with Eqs. (6.4.54) and (6.4.55) shows that the 
ratios of the fast and slow contributions to dp, and wy, are of order 


fe z ; 5 
apt _ a2 H2 DR 2 ee 7 aH DR 2 (6 j 63) 
slew q? ÖM ? slow q ÕM me 
q 


We are assuming in this section that the wavelengths are short enough so 
that perturbations enter the horizon during the radiation-dominated era, 
so these ratios are much less than one at matter—radiation equality. Subse- 
quently the ratio of fast to slow contributions to 5p, decreases as 1/ a>, while 
for Wq the ratio of fast to slow contributions decreases as 1 /a>/*. Hence it is 
a good approximation to take dpy = bow and Yq = pew throughout the 
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matter-dominated era. Nevertheless, because the fast terms in Eqs. (6.4.59) 
and (6.4.60) have an oscillatory dependence on q, we will need to take them 
into account when we consider baryon acoustic oscillations in Section 8.1. 

We also need ôq and du,, in calculations of the cosmic microwave 
background anisotropies. Comparison of Eqs. (6.4.45) and (6.4.46) with 
Eqs. (6.4.57) and (6.4.56) shows that the ratios of the fast and slow contri- 
butions to 5,4 and uy, are (apart from the damping of the fast terms) of order 


fe = fe = 
bq OBR buyg q ÖR N 
slow Ge py’ ôuslow aH pm ` _ 


These ratios are much larger than unity at horizon entry, and remain so until 
matter—radiation equality. After that the ratio of fast to slow contributions 
to 5), remains of the same order of magnitude, while for du, the ratio of 
fast to slow terms decreases like 1/,/a. Hence, once the wavelength enters 
the horizon, the slow contribution to 6,, is nominally smaller than the fast 
term, while the slow contribution to du, remains nominally smaller than 
the fast term until late in the matter-dominated era. Nevertheless, we will 
keep the slow as well as fast terms here, because they are not affected by the 
damping that suppresses the fast terms. Also, even though it is relatively 
small, the slow term in dzg will be found in Section 7.2 to produce a char- 
acteristic effect in the plot of the cosmic microwave background multipole 
coefficients Cz vs. £ that would not be present with the fast term alone. 


6.5 Scalar perturbations — interpolation & transfer functions 


In the previous two sections we found analytic results for wavelengths that 
are long enough to enter the horizon well after matter—radiation equality, 
or short enough to enter the horizon well before matter—radiation equality. 
Unfortunately, this leaves out wavelengths that enter the horizon around 
the time of matter—radiation equality. It is wavelengths of this magnitude 
that make the dominant contributions to the first acoustic peak at around 
£ = 200 in the multipole coefficients of the cosmic microwave background 
anisotropies. In this section we will consider how to construct formulas 
for the fluctuations that interpolate between the results of Sections 6.3 and 
6.4, concentrating on results in the matter-dominated era, which are of the 
greatest observational interest. 

We can get a good clue to this interpolation by first considering the limit 
of negligible baryon mass density and negligible damping, for which the 
form of the solution in the matter-dominated era can be found exactly, with 
no limitations on wavelength. In this limit, and leaving aside the neutrinos, 
the equations for the perturbations in the matter-dominated era are given for 
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all wavelengths by Eqs. (6.3.17), (6.3.18), (6.3.20) and (6.3.21), with R = 0: 


d oe 
Gt va) = —3t 8a - (6.5.1) 
byq — (@/a°)buyq = =W (6.5.2) 
dpg = —Vq- (6.5.3) 
d —2/3 _ | —2/3 
di (i Suyq) me Ba (6.5.4) 


with a œ 17/3. These have four independent exact solutions, three of which 
are quite simple: 


Solution 1: 


E 39202 E qt 
Dm gya? 1T R’ 
ôpq =l, ôuyq = —t 
Solution 2: 
ôDq = Pq =0, 


ô = -cos ( [<) ôu = -sin ( [<) 
vq = Pica aay = MES age Ni ae 


Solution 3: 
ôDq = Pq =0, 


ô =sin( f =) Uyg = i cos ( k =) 
va Tera EE Ny Waa 


The fourth solution is more complicated; it decays, with dp, « 1/t and 
Wg x 1/ t?. As we will see, the initial conditions are satisfied by a linear 
combination of the first three solutions, so the fourth solution will not con- 
cern us here. 


Without any loss of generality, we can write the linear combination of 
solutions 1, 2, and 3 that fits the solution at earlier time in a form that 
simplifies this fit, as! 


IP PRET (k) 
3q7tR°T (K) 
i [=o (6.5.6) 


1From now on, we define the zero of time so that a « t!/2 in the radiation-dominated era. This 
is different from the definition of the zero of time in Solutions | and 2, where time is defined so that 
a œ t°’? in the matter-dominated era. However, the difference this makes in the integral fh dt/a is just 
a constant, which can be absorbed into A(x). 
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oO 


3R t 
yg = Syg = —* | T(K) —S(k) cos | q eae +Atk))}, 6.5.7) 
5a A 5 0 V3a 


ata Bo ' dt 
bing = Bing = -TO + Sw safa | + a00)}, 


(6.5.8) 


where S(x), T(x), and A(k) are time-independent dimensionless func- 
tions of the dimensionless rescaled wave number introduced in Sections 6.2 
and 6.4: 


q2 _ (g/ao)V2R _ 19.3(g/a0)(Mpe~'] 


= = , 6.5.9 
‘ dEQHEQ Holm uh? ( ) 


in which ago and Hgo are respectively the Robertson—Walker scale fac- 
tor and expansion rate at matter-radiation equality. These are known as 
transfer functions. (These functions can only depend on x because they 
must be independent of the normalization of the spatial coordinates and are 
dimensionless.) The division we made in the previous two sections between 
long and short wavelengths can be expressed in terms of the parameter x. 
In the matter-dominated era we have t = 2/3H = (t/3HEQ)(a /aEQ)*! 2 and 
PR/PM = aEQ/a, SO 
Pq? pr 4? 


Hence £?q?/a? is much less or much greater than pyy/pp, according as the 
parameter « is much less or much greater than one. 

We choose to write the linear combination of solutions 1, 2, and 3 in 
the form (6.5.5)-(6.5.8) because it leads to simple values for the coefficients 
S(k), T (k), and A(x) fork « 1. For pg = 0, the results (6.3.23), (6.3.24), 
(6.3.35), (6.3.36) obtained earlier for moderately long wavelengths satisfying 
t?q?/a? < pm /pr, or in other words in the limit « < 1, are consistent with 
Eqs. (6.5.5)-(6.5.8), and tell us that in this limit? 


Ti 1, S@Mely Alk) > 2835 (6.5.10) 


(The case of extremely long wavelengths does not arise here, because it 

requires that £?q?/a? < R, and for the present we are taking R = 0.) 
Similarly, for og = 0 and F = 0, the results (6.4.59)-(6.4.62) obtained 

earlier for short wavelengths satisfying t7q7/a* >> Py/pp, or in other words 


2The limit 2«//3 for A(k) is the difference for y = m/ör > | between the phase /3qt/a = 
2« /¥/V/3 in Eq. (6.3.34) (for R < 1), and the phase q fy dt/V3a = 2«L,/1 + y — 11/3 in Eqs. (6.5.7) 
and (6.5.8). 
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in the limit x >> 1, are consistent with Eqs. (6.5.5)-(6.5.8), and tell us that 
in this limit? 


45 [7 4 
Tee) > 5 [-5 +7 +in( =), Sik) > 5, Aw) 0. 
(6.5.11) 


As we shall see in Chapter 8, the decrease of 7 (x) for large « is very impor- 
tant in determining the intensity of fluctuations in the dark matter density 
at various wavelengths. (We can understand this decrease qualitatively by 
following the history of the dark matter density fluctuations. According to 
Eq. (6.2.27), for all wavelengths 5p, grows like ¢ at early times, when the uni- 
verse is radiation dominated and the wavelength is outside the horizon. For 
k <1 this growth continues until the universe becomes matter dominated, 
after which Eqs. (6.3.13) and (6.3.23) tell us that 8p, grows like 1/3 both 
before and after the wavelength enters the horizon. In contrast, for « >> 1 
the growth of Sp, like ¢ continues only until the wavelength enters the hori- 
zon, which in this case is during the radiation-dominated era, after which 
according to Eq. (6.4.50) dp, grows only logarithmically with time until the 
universe becomes matter dominated, after which according to Eq. (6.4.54) 
it grows as 77/°, just as in the case x < 1. The end of the period of growth 
proportional to ¢ is at the time when g/a ~ 1/t, which since a œ ,/t is 
at t x 1/q?, so the growth until matter—radiation equality is proportional 
to Ing/q’, accounting for the asymptotic behavior of T(x). That is, the 
smallness of T(x) for x >> 1 reflects not a decay of 5p,, but rather the 
failure of 5pg to grow appreciably during the interval from horizon-entry to 
matter—radiation equality.) 

For values of « of order unity, we have to find some way of interpolating 
between these two limiting cases. This can be done almost “by hand”; it 
turns out that almost any smooth interpolation between the limits k « 1 
and x > | gives reasonable results for the cosmic microwave background 
anisotropies. For better precision, we need to solve the full coupled equa- 
tions (6.2.9)—(6.2.15) (still with og = 0) numerically for general wavelengths 
and general values of y = py/Or, imposing the initial conditions found in 
Section 6.1 for y « 1, and then comparing this solution for y > 1 with 
Eqs. (6.5.5)-(6.5.8). These equations can be put in a dimensionless form by 
using y as the independent variable, and writing 


Spq =K R AOE Syg = 8yqg =k RTOS; 


Wq = (K°HEQ/WV DRIF), Sttyg = ĉuivg = (k?V2/4 HEQ)RIg(y) . 


3This result for T (<) was given by S. Weinberg, Astrophys. J. 581, 810 (2002) [astro-ph/0207375]. 
The fact that the Mészáros equation (6.4.35) implies a fall-off of the transfer function for large wave 
number k like Ink Ik” had been pointed out by W. Hu and N. Sugiyama, Astrophys. J. 444, 489 (1995). 
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Eqs. (6.2.9)-(6.2.15) then become 


d/o-.\_ 3dO) 4rQ) 
VIFIO f0))=-— - y 


d 
vit yd) = IO); 
SAT ro- 20) = yf 0) 
Iy l = ESR Q), 


1 
a (22) = Ley, 


dy\ y 
In this notation, the initial conditions (6.2.27)-(6.2.29) read 


dy) >r) >y, 
4 


E He, 0) > -7 . 


When evaluated for y >> 1, the numerical solutions of these equations* 


match Eqs. (6.5.5)-(6.5.8), with transfer functions given in Table 6.1. Inspec- 
tion of this table shows that these numerical results agree with the analytic 
results (6.5.10) and (6.5.11) in the limits x « 1 and x > 1, respectively, 
although «x must be quite large before the asymptotic results (6.5.11) are 
reached. 

As we will see in the following two chapters, the microwave background 
anisotropies and the correlation function of large scale structure are given 
by integrals involving one or more of these transfer functions, so it will be 
useful to give approximate analytic formulas for these functions. Dicuss 
has found “fitting formulas,” which to a good approximation agree with the 
asymptotic formulas (6.5.10) and (6.5.11) fork « 1 and« > 1, and (except 
for A(x) at large x) generally match the numerical results of Table 6.1 at 
intermediate values of « to better than 2%: 


1/2 

eb In[1 + (0.124 K)? | 1+ (1.257 K)? + (0.4452 K)f + (0.2197 K)f 
K)~ > 

(0.124 «)2 1 + (1.606 x)? + (0.8568 «)4 + (0.3927 K)® 
(6.5.12) 
2 

TER 1 + (1.209 «)? + (0.5116«)4 + 5! (0.1657 x) (6.5.13) 

= | 1 + (0.9459 K)? + (0.4249 x)4 + (0.1657 K)6 : i 


4 thank D. Dicus for this numerical calculation. 
5D. Dicus, private communication. 
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Table 6.1: The Scalar Transfer Functions 


K S T A K S T A 

0.1 | 1.0167 0.9948 0.1207 10 | 3.9895 0.1608 0.3270 
0.2 | 1.0551 0.9780 0.2240 11 | 4.0546 0.1440 0.3147 
0.3 | 1.1147 0.9569 0.3156 12 | 4.1172 0.1298 0.2962 
0.4 | 1.1891 0.9339 0.3852 13 | 4.1841 0.1178 0.2850 
0.5 | 1.2680 0.9101 0.4423 14 | 4.2175 0.1075 0.2747 
0.6 | 1.3529 0.8860 0.4800 15 | 4.2676 0.0985 0.2604 
0.7 | 1.4388 0.8620 0.5148 16 | 4.3135 0.0907 0.2541 
0.8 | 1.5195 0.8384 0.5336 17 | 4.3336 0.0838 0.2438 
0.9 | 1.6081 0.8154 0.5531 18 | 4.3796 0.0777 0.2339 
1. | 1.6801 0.7930 0.5637 19 | 4.4043 0.0723 0.2296 
1.2 | 1.8330 0.7502 0.5784 20 | 4.4233 0.0675 0.2195 
1.4 | 1.9777 0.7104 0.5854 25 | 4.5271 0.0496 0.1920 
1.6 | 2.1126 0.6734 0.5842 30 | 4.6051 0.0383 0.1713 
1.8 | 2.2354 0.6391 0.5782 35 | 4.6650 0.0305 0.1542 
2. | 2.3451 0.6074 0.5700 40 | 4.7087 0.0249 0.1396 
2.5 | 2.5895 0.5378 0.5537 45 | 4.7389 0.0209 0.1276 
3. | 2.7839 0.4798 0.5334 50 | 4.7605 0.0177 0.1182 
3.5 | 2.9473 0.4311 0.5094 55 | 4.7794 0.0153 0.1111 
4. | 3.0970 0.3898 0.4854 60 | 4.7992 0.0134 0.1053 
4.5 | 3.2346 0.3545 0.4659 65 | 4.8192 0.0118 0.0997 
5. | 3.3506 0.3241 0.4509 70 | 4.8365 0.0105 0.0940 
5.5 | 3.4114 0.2976 0.4367 75 | 4.8487 0.0094 0.0885 
6. | 3.5181 0.2726 0.4203 80 | 4.8563 0.0084 0.0838 
6.5 | 3.5953 0.2531 0.4029 85 | 4.8622 0.0077 0.0803 
7. | 3.6754 0.2361 0.3884 90 | 4.8695 0.0070 0.0776 
7.5 | 3.7473 0.2200 0.3782 95 | 4.8792 0.0064 0.0751 
8. | 3.8015 0.2056 0.3695 100 | 4.8895 0.0059 0.0722 
8.5 | 3.8432 0.1927 0.3590 

9. | 3.8865 0.1810 0.3465 

9.5 | 3.9380 0.1704 0.3350 


1/4 
(0.1585 x)? + (0.9702 «)* + (0.2460 K)f h 


1 + (1.180«)2 + (1.540 «)4 + (0.9230 K)f + (0.4197 «)8 
(6.5.14) 


Alk) S l 


These transfer functions are shown in Figure 6.1. A fitting formula for 
T (k) that includes the effects of neutrino anisotropic inertia and fits the 
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Transfer function 


4} 


Figure 6.1: The transfer functions T (x) (solid curve), S(«) (long dashes), and A(x) (short 
dashes), as functions of the rescaled wave number x. 


CAMB numerical results is given by Eisenstein and Hu. Because of the 
neglect of neutrino anisotropic inertia, the results for T(x) of Eq. (6.5.12) 
and Table 6.1 are about 4% too low. 

Now we have to consider how to take damping and the non-zero ratio of 
baryon to photon density into account. Eqs. (6.3.23), (6.3.24) and (6.4.59), 
(6.4.60) show that in the limits of either short or long wavelength, the leading 
terms in dp, and wy are unaffected by either damping or baryons, so in 
leading order we can simply use Eqs. (6.5.5) and (6.5.6): 


IPRIT (K) 
pg = ———— oa 
2g 10a? iar) 
3¢7tR°T (K) 
g= ea ae (6.5.16) 


Damping affects the short-wavelength results (6.4.61) and (6.4.62) for duy,q 
and 6,, by multiplying the sines and cosines with a factor exp(— h Idt). 
This factor is absent in the long-wavelength results (6.3.36) and (6.3.35) 
for these perturbations, but this factor is essentially unity anyway for long 
wavelengths, because I œ g*. Hence for all wavelengths we can take damp- 
ing into account by simply multiplying the sines and cosines in Eqs. (6.5.7) 
and (6.5.8) by exp(— f Tdt). Finally, in both the short-wavelength results 
(6.4.62) and (6.4.61) and the long-wavelength results (6.3.35) and (6.3.36), 


6D. J. Eisenstein and W. Hu, Astrophys. J. 496, 605 (1998). 
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the effect of the non-zero baryon density is to multiply the non-oscillatory 
term in 6,, with 1 + 3R, multiply the cosine in 6,, with (1 + R)~!/4, and 
multiply the sine in 5u,g with (1 + R)~>/4, so it is highly plausible that a 
finite ratio of baryon density to photon density can be taken into account at 
all wavelengths by making the same alterations in Eqs. (6.5.7) and (6.5.8): 


oO 


3R? 
ôyq = Bq = ears [7 Ad + 3R) 


t 
7 -1/4 „— [itd qadt ) 
(1+ R) e /0°“S(k) cos (f a JR TR + A(k) )], 
(6.5.17) 


oO 


3R? 
ôUyq = ÔBq = ER [-tT (K) 


g - fi rat (f qdt )] 
Teepe ade 
(6.5.18) 


To repeat, these results agree with the results of Section 6.4 for short wave- 
length, they agree with the results of Section 6.3 for long wavelength (for 
which F is negligible), and they agree with the results found in this section 
for all wavelengths when damping and the baryon density are neglected. 

We must say a little more about the function 7 («), which is commonly 
known as the transfer function. It is conventional to write this transfer 
function as a function of a variable Q: 


(6.5.19) 


which according to Eq. (6.5.9) is the same as «/19.3. In these terms, 
Eqs. (6.5.10) and (6.5.11) for 7T (k) now read 


O-0 


oa (6.5.20) 


1 
T(k) > 4 1n(2.400) 
(4.07 Q)? 


A numerical solution of the equations for growth of dark matter density 
fluctuations has been fit for large Q with the formula’ 


In(1 + 2.340) 
2.340 


—1/4 
x [1+3.890+(16.10)?+(5.460)°+(6.710)*| . (6.5.21) 


TgBgBKs(k) ~ 


1J. M. Bardeen, J. R. Bond, N. Kaiser, & A. S. Szalay, Astrophys. J. 304, 15 (1986). 
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This goes to In(2.34Q)/(3.96 Q)* for large Q, in excellent agreement with 
the analytic result (6.5.20). However, it should be noted that although the 
BBKS transfer function (6.5.21) is correctly normalized at Q = 0, it does 
not give a good account of the behavior of the transfer function for Q < 1. 
The power series expansion of Eq. (6.5.21) contains odd as well as even 
terms in Q, which is not consistent with the requirement that it must be an 
analytic function of the three-vector q. The Dicus fitting formula (6.5.12) 
was constructed to be analytic in the three-vector q at q = 0, and in fact fits 
the numerical results of Table 6.1 better than the BBKS transfer function 
(6.5.21). In our analysis of cosmic microwave background anisotropies, we 
will use the fitting formula (6.5.12) instead of Eq. (6.5.21). 

All this has been for 6 = pg/py < 1, though we have now taken 
into account a non-negligible ratio R = 36g/4,. The corrections to the 
transfer function for finite values of 6 are roughly 10%; they have been 
calculated analytically® for « >> 1 to lowest order in 6, and numerically? 
for general «x and for selected values of 6 to all orders in £. In using the 
transfer function in calculations of large scale structure, it is common to 
use a simple modification of the fitting formula. Peacock and Dodds!” 
proposed in effect that the same fitting formula could be used for finite 
baryon density, but with « = (k./Qr/HpQy) exp(Qzg). This worked well 
for the limited range of cosmological parameters studied, but it is physically 
impossible for the transfer function to have this sort of dependence on Qg. 
There is no way that the physical processes during the radiation-dominated 
era that are responsible for the transfer function to know anything about 
the time at which we happen to measure cosmological parameters like py. 
Aside from «x, the transfer function can only depend on quantities such 
as B = g/m = Qp/Qy, which is constant, or Qyh?, which (for a 
known present radiation temperature) tells us the matter density at any 
given temperature. Indeed, Sugiyama!! pointed out that the correction 
factor exp(Qg) actually works well only for values of Qw close to unity. 
As an alternative that would apply for smaller values of Qm, he proposed 
a correction factor exp(Qg + Qg/Qym), which is physically impossible for 
the same reason as exp(Q2g). Another difficulty with all these suggestions is 
that the baryon correction must disappear for small wave number, because 
in this case pressure forces are negligible, and baryons behave just like cold 
dark matter. A baryon correction that satisfies all these physical criteria has 
been proposed by Eisenstein and Hu: The transfer function is evaluated 


8s. Weinberg, ref. 2. 

9J, A. Holtzman, Astrophys. J. Suppl. 71, 1 (1989). 

10J, A, Peacock and S.J. Dodds, Mon. Not. Roy. Astron. Soc. 267, 1020 (1994). 
IIN, Sugiyama, Astrophys. J. 100, 281 (1995). 
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(KARAR l-a J! 
oe (3) E FE ed ae 


a = 1 — 0.328 In(431 Qmh?) B + 0.38 In(22.3 lmh?) B? , 


with « taken as 


where 


and s is the acoustic horizon at the time t = tz of last scattering, projected 
(as is k) to the present: 


sean) f vs dt 
= No OO) 


with dy given by Eq. (2.6.32). 


=(1+zz)dy , 


6.6 Tensor perturbations 


We next turn to the tensor modes. These are considerably simpler to study 
than the scalar modes, so what took five sections to analyze for the scalar 
modes will be treated here in just one section. As in the case of scalar 
modes, we begin by setting down the full set of equations used in computer 
programs like CMBfast, and then move on to approximations. 


A. Cold dark matter and baryonic plasma 


As already mentioned, the particles of both the cold dark matter and bary- 
onic plasma move too slowly to contribute any anisotropic inertia. In ten- 
sor modes there are no perturbations to densities or streaming velocities, so 
there are no perturbations to either the cold dark matter or baryonic plasma 
that need to be followed here. 


B. Gravitation 


According to Section 5.1, in tensor modes the gravitational perturbation 
takes the form: 
Sgi(x, 1) = a’ (t) DjX, t), (6.6.1) 


with Dj(x, t) satisfying the wave equation (5.1.53): 
Dj +3HDj — a? V" Dy = l6rGri , (6.6.2) 
and the trace and transversality conditions 


Di=0, aD =0. (6.6.3) 
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6.6 Tensor perturbations 


We will see that the anisotropic inertia tensor Ty 


Di, so Eq. (6.6.2) has two independent solutions. During the period when 
the perturbation is outside the horizon, the anisotropic inertia tensor mi 
and the av" Di term are negligible, and Eq. (6.6.2) becomes 


is a linear functional of 


Dj + 3H Di =0. 


Hence outside the horizon one of the solutions is constant, while the other 
decays as f a~?dt, which in the radiation dominated era goes as t~!/?. For 
all interesting wave numbers the perturbation remains outside the horizon 
during many e-foldings of cosmic expansion, so the decaying mode becomes 
negligible, and we can consider only the other mode. The metric perturba- 
tion and anisotropic inertia can therefore be put in the form 


Dja, t) = D) d>q eit Bq, X) ey, X Dal) , (6.6.4) 
A= 


ians D f PaA a. veGr77O. 665 
A=+2 
where 6(q, à) is a stochastic parameter for the single non-decaying mode 
with wave number q and helicity A; eij (ĝ, A) is the corresponding polarization 
tensor, defined in Section 5.2, with ej = qe = 0; and Dg4(t) is the solution 
of the wave equation 


Dy t+3HDy +a °g Dy = 16r Gri . (6.6.6) 


We will return later to the solution of this equation. 


C. Photons 


The Boltzmann equation for the photon density matrix perturbation 
ên” (x, t) in the tensor mode is given by Eq. (H.35) as 


adn (x,p.t) Pk d5n'I(x,p,t) 24O) 
at a(t) axk a(t) 


aah, P) Ppi Dex, t) (8y — PiP) 


bn} (x, p, 0) 


S 
= —øe(f) ôn} (x, p, 1) + et J apy 


x [an] (x, pôi, D — pide Sn (x, ppi, D — Bide n$ (x, p1, D) 


+ Pidjbxbr Buk! (x, pôi, D| , (6.6.7) 
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in which we have used Eq. (6.6.1) for the metric perturbation. As in the case 
of scalar modes, instead of sn!) (x, p, t), we will concentrate on the fractional 
intensity matrix defined by the analog of Eq. (6.1.13): 


Cc 
HO by OIyp. =a) f apd dp. 668) 
We seek a solution in the form 


Igx,b,0) = Ef E E A (6.6.9) 
A=t2 


The Boltzmann equation ome then takes the form 
ð Ji (q, Ê, t, à) 


4 LAEE 


ot a 
+ PxP1 ek Â, 4) Dg (0) (6 E ibi) 
‘ 3a@¢(t) 


x Rac t, à) — Pie Tig lA, t, à) — Pjk Tik Qs t, à) 
+ PiPjPKP1 Iki (q, t, i) | ; (6.6.10) 


where 
2, A 


ap 
Fylat r= fH Jij(q. pst, d) . (6.6.11) 


Furthermore, because Jj(q, p, t, à) for a given helicity à must be a linear 
combination of the polarization tensor components e;;(q, à) with the same 
A, while gx exj(g, à) and ekk(ĝ, à) both vanish, the only possible form of 
Jy (q, t, à) allowed by rotational invariance is just e;;(ĝ, 4) times some func- 
tion of q = |q| and ¢. This relation is conventionally written 


2p Sos 


Rotational invariance allows the intensity matrix perturbation to be written 
in the form 


X 1 EAN ot A 
Jj(q,ĝ, t, à) = 5 (èi - pip) PrP1eki(G,A) 
PAS T A A 
x (aP POFA G D) 
+(€1/(4. 2) — Pirey QN — PPren Â, + PibjPrB1 eA») 


x AP (GP 4.0 (6.6.13) 
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6.6 Tensor perturbations 


(Here the superscript T stands for “tensor,” while the subscript T stands for 
“temperature.” The coefficients are chosen so that Jj; is proportional to Ay, 
and the polarization is proportional to Ap.) A third term proportional to 
(Gi—Pi(P-Q)) (Gj —Pj P- @)PKP1eni Would be allowed by symmetry principles, 
but is not generated in the Boltzmann equation by Thomson scattering. 
Using Eq. (6.6.13) in Eq. (6.6.10) yields separate Boltzmann equations for 
(T) (T) 
Ay and Ap 


0 =: 

aor 4: wd) +ia Oqu AP q, u, t) 

= -2D A) - oe) AP qu, t) + olt) Yq, (6.6.14) 
AP, u, t) tia Oqu AP, u, t) 


= —ve(t) AP (q, p, Ð — olt) Vq, t) . (6.6.15) 


T 


The functions Aw and AG: may be expanded in Legendre polynomials 
[0,0] 
APAP 4.0 = Y UD PA DARAD (6.6.16) 


£=0 


CO 
AD (Gb: 4.0 = Yi 2+ 1 PeG-PALLG.D. (6.6.17) 
£=0 


Using the familiar recursion relation 


£+1 £ 
z Pelz) = z~~ Pe (z) +> Pela), 


22+ 1 2+ 1 
we find that the Boltzmann reais (6.6.14) and (6.6.15) now read 
M+ — 2 (e+ pall, ta) 
= e + oW) 84,0 — OAT) (6.6.18) 
Ape + ao + DAP ea T tay? 4) 
= oY beo — We App, - (6.6.19) 


To calculate the source term W(gq,f) in terms of partial waves, one first 
integrates Eq. (6.6.13) over p, using the formulas 


LNs, idee fe x bo se Titec ae ee 
[ Poro Direna = sei feroa — @-0?) 

De P E E E a Df DR pe A ROVE 
| Por DDB PBEM = ji f erod (1 — @-q) ) : 
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where f is any function of p - g and ej is any symmetric traceless matrix 
function of g with g;e; = 0 and ej; = 0. The integral gives 


3 f LẸ 1 k oN a 
va.n=-3 [ l-50 -00) APAD 


l 2 A 
(60+ -6 S) Papa]. 
(6.6.20) 


Inserting the partial wave expansions (6.6.16) and (6.6.17) then gives:! 


Y(q, t) = aang) + apan Fag.) ANa 


FAD, t) — SaDa t). (6.6.21) 

Eqs. (6.6.18) and (6.6.19) thus form a closed system of coupled 
differential equations for the partial wave amplitudes produced by a given 
gravitational field perturbation D¿(t). Of course, their solution requires a 
truncation of the partial wave expansion at some maximum £. 

The solution is of interest in itself, because we can measure anisotropies in 
the cosmic microwave background. It will be applied to these anisotropies 
in Chapter 7. It is also needed in calculations of the tensor anisotropic 
inertia. In tensor modes, the only non-vanishing contribution of photons 
to the energy-momentum tensor is to the space-space component (6.1.10) 


STi) = a4) J d?’p d? (t)dn* (x, p, t) pPibj 
-505 f dasane f Binapani 
7 
= 0D J Paban eua 
«ft Aan (q, P- ĝ, ODD PBI 
= y(t) Efe B(g, A) e0, 
7 


DEON E a ON? 
S aPar-ao(l-@ 9). 662 


IR, Crittenden, J. R. Bond, R. L. Davis, G. Efstathiou, and P. J. Steinhardt, Phys. Rev. Lett. 71, 324 
(1993) [astro-ph/9303014]. 
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For tensor modes there is no pressure perturbation, so according to 
Eq. (5.1.43), this is the same as the anisotropic inertia tensor my jæ D. 
Comparing Eq. (6.6.22) with Eq. (6.6.5) thus gives 


=i (t) = 20 22 
Tyq 


-AT GP “a a 


=25 0|; e aot Aane aan] 
(6.6.23) 
Experience shows that to accurately calculate the partial wave amplitudes 
upto £ = 4, which appear in Eqs. (6.6.21) and (6.6.23), one needs to solve the 
Boltzmann equations for the partial wave amplitudes up to larger values of £, 
up to £ = 10. Once W is calculated in this way, we can calculate J; (q, p, t, à) 


for very much higher values of £ by using the “line of sight” solution of 
Eq. (6.6.10): 


t t dt” t 
Ji (q, pt, A) = J dt exp (-ia of — -f dt” onl) 
t v alt") 1 


x | — ÎrPı (èy - pip) eÂ, X) Dat) — welt) Yq, t’) 


(e0 X — Piôker QX — PPren, 2) + PibjPrPreK @. »)| 
(6.6.24) 
where ¢; is any time that is early enough before recombination so that 
wc(t1) > H(t), which allows us to drop a term proportional to 


Ji(q,P,t1,4). In terms of the temperature and polarization amplitudes 
defined by the decomposition (6.6.13), the line-of-sight integrals read 


t ; f t dt” 
AS (q, hH, t) T ENR? H, t) — 2 | dt exp [iow f aN, 
ti v alt”) 
t 
E J welt") ar x Dylt’) , (6.6.25) 
p 
(T) F t dt" t J 
A „H4, t) = — t exp|—i — — | a(t") dt” 
p (4b,t) f P| au f Ca Í (£) | 
x welt’) Yq, t). (6.6.26) 


This line of sight integral also provides an alternative to the use of the 
truncated partial wave expansion.? We can derive an integral equation for 


25, Weinberg, Phys. Rev. D 74, 063517 (2006) [astro-ph/0607076]; D. Baskaran, L. P. Grishchuk, 
and A. G. Polnarev, Phys. Rev. D 74, 083008 (2006) [gr-qc/0605100]. 
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W(q, t) by simply analytically integrating Eq. (6.6.24) over p. Equating the 
coefficients of e; on both sides gives the integral equation 


t t 
Yq, t) = al dt' exp -f w(t") a 
ty tH 


ji ; t dt" ; t dt" , 
[rolf orh, E) ve] 


(6.6.27) 


where K (v) and F (v) are the functions 
Kw) =fa(v)/v?, F) = jo(v) — ji w)/v + 2j(v)/v* (6.6.28) 


Eq. (6.6.27) can be solved efficiently either by iteration, or by numeri- 
cal recipes appropriate for integral equations of the Volterra type. Once 
Y (q, t) is calculated in this way, the complete photon intensity matrix can be 
obtained by a numerical integration in Eq. (6.6.24). 


Neutrinos 


The Boltzmann equation for the perturbation ên, (x, p, t) to the neutrino 
phase space density is given by Eq. (H.14) as 


ddn, (X, p. t) Di ddny (X, p. t) pn (p) A A 
et ĝi Dilx,t) 6.6.29 
at a(t) ax! 2 Pibi Dy(% 9) ee!) 


in which we have used Eq. (6.6.1) for the metric perturbation. As in the 
case of scalar modes, instead of 5n,, we find it more convenient to deal 
with a dimensionless intensity perturbation J, defined by the analog of 
Eq. (6.1.42): 


a(t) (NI (x,p,t) = N, Í ET dp , (6.6.30) 
0 


where N, is the number of species of neutrino, counting antineutrinos sep- 
arately, and py = N,a~* if Arp*iny(p). This satisfies a Boltzmann equa- 
tion 
ad (x, P, t) Îi OJ (x, P, t) AA fF 
— = —2pip; Dij(x, t), 6.6.31 

at a(t) ax’ Pip; DjX, t) ( ) 
with Dj given by Eq. (6.6.4). We will be able to find a solution in the 
form 


EODD J d°g e'™*B(q, A) ey 0, DAPA a, P-D. (6.6.32) 
A=+2 
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Then Eq. (6.6.29) becomes an orc for AG: 
Aq, u, t) PREL T 5 BEAD) (q, u, t) = —2D,(t) (6.6.33) 
This can be solved by a partial wave expansion 
AM Gu.) =X Q€+ DAVE, (6.6.34) 


L 


with An (q, t) satisfying 


AT ) (T) (T) = : 
Aw + a Tt + DAG, LAG) = -2b 0. (6.6.35) 
This of course needs to be truncated at some more-or-less arbitrary maxi- 
mum value of £. But instead we can find a direct solution of Eq. (6.6.33), 
as a line of sight integral 


t t dt" : 
AP q, u, t) = —2 J dt’ exp (~ian J a) Dıt’). (6.6.36) 
ti t a(t ) 


We take ¢; soon enough after the decoupling of neutrinos at T ~ 10!°K 
so that at this time the distribution of neutrinos is still essentially that of 
local thermal equilibrium. In this case the perturbation to this distribution 
arises only from the perturbations ôT, and du, to the neutrino temperature 
and streaming velocity, which do not have tensor components, so we do not 
need to include an initial value term AWD (q, u, t1) on the right-hand side of 
Eq. (6.6.36). 

It will be a long while before anyone measures the angular distribution 
of cosmic neutrinos, so the only use to be made of calculations of ôn, is 
in calculating components of the energy momentum tensor. In the tensor 
mode, the only non-vanishing componentis ôT! ;, given by the same formula 


(6.6.22) as for photons, except that A ae instead of Ark 


STİ; (x, t) = a *(t) J d’p ôn, (X, p, t) PPiPj 
= 3 iq:x dp a a A 
= PD) | a BaN e) Z—IG,b.t Vib; 
À 
= py(t) Efe pa, A) een, 2) 
À 
ap ee 
x J Ty AV GP ĝ, DPiPiPiPk 
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= POY | Ba baD iea) 
Xr 


1 ap rae PERS 2 
if Eara a N(1-@-@) . (66.37 


This is the neutrino contribution to the anisotropic inertia tensor mi. Comp- 
aring this with Eq. (6.6.5) then gives 


Ton _ Pvit) dÊ ry Å nA oe 
my) = Paap ao(1-G-a) 6638 
As for photons, this can be evaluated using the at wave expansion: 
rO = 20 ANG. + AGN + AGO]. 6.6.39) 
But for neutrinos it is easier to use the explicit solution (6.6.36). Using 


Eqs. (6.6.36) and (6.1.55) in the third expression of Eq. (6.6.35) and com- 
paring with Eq. (6.6.5) gives? 


t t tb, 
Ty g(t) = —4py (t) J d'k (a [ a) DES (6.6.40) 
a 
where 
sinv 3cosv 3sinv 
Kejsi aaa a a) (6.6.41) 


We will use this below in calculating the decay of gravitational waves that 
exit from the horizon in the radiation-dominated era. 


We have found a complete set of differential equations for the tensor per- 
turbations. Now we shall turn to the calculation of D,(t), which provides 
the essential input in the calculation of tensor anisotropies in the cosmic 
UCLOWENS background. We begin by neglecting the tensor anisotropic 
inertia me , returning to a consideration of its effects at the end of this 
section. The anisotropic inertia is negligible during most of the history of 
the universe, when the cosmic energy density is dominated by one or more 
perfect fluids, and it is never very large. With this approximation, the field 
equation (6.6.6) governing the Fourier components of the tensor component 
Dj of the metric perturbation is simply 


: 2 
ss a. 
By) + 3D, + LDO =0. (6.6.42) 


38. Weinberg, Phys. Rev. D 69, 023503 (2004) [astro-ph/0306304]. 
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As already mentioned earlier in this section, at very early times when g/a « 
a/a one solution is constant while the other decays, so as long as this era 
lasts sufficiently long we can neglect the decaying solution, and take our 
initial condition that D4 (z) goes to a constant D7 at early times. 

To treat the evolution of D,(t) at later times, 1t is convenient once again 
to change the independent variable from ¢ to y = a/agQ = Pm /PR, where 
agg is the value of the Robertson—Walker scale factor at matter—radiation 
equality. Assuming that the energy density of the universe is governed by 
radiation and non-relativistic matter,4 we can put Equation (6.3.11) in the 
form 


Hggdt _ ydy 
2 SIFr’ 
where Hgo is the expansion rate at matter-radiation equality. Then 
Eq. (6.6.42) becomes 


Dg 
z t 


(6.6.43) 


+K’ D; =0 (6.6.44) 


d? 21 +y) 1\ dD, 
1 
mine dy ( +5) dy 


where « is the dimensionless rescaled wave number (6.5.9): 


w= V24  (4Q/a0)RR _ 19.3(g/ao)[Mpc™'] 
- apogeo oQ Qule 


(6.6.45) 


We also have the initial condition, that Dy > D; for y & 1. As in the case 
of scalar modes, we can find analytic solutions in two extreme cases, for 
k <landk >l. 

Consider first the case x >> 1. In this limit we already have g/a >> åà/a 
at matter—radiation equality, so horizon entry occurs early in the radiation- 
dominated era. Hence for x >> 1 there are two overlapping eras; an era 
when the universe is radiation dominated extending to early times when the 
perturbation is outside the horizon, and an era when the perturbation is 
well inside the horizon extending to the present, in both of which we will be 
able to find analytic solutions. Because these eras overlap, we will be able 
to match the analytic solutions in the era of overlap, and in this way relate 
the gravitational wave amplitude at late times to the initial condition when 
the perturbation is outside the horizon. 

First, in the era when the universe is radiation dominated y « 1, and 


Eq. (6.6.44) becomes 
dD; (2N) dD 
D;=0. 6.6.46 

dy? +(5) dy A ra eo 


4The evolution of the tensor amplitude under more general assumptions is considered by L. A. Boyle 
and P. J. Steinhardt, astro-ph/0512014. 
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The solution which approaches a constant Dj for y > 0 is 


pa pe, (6.6.47) 
KY q 


Next, consider the case of perturbations deep inside the horizon, that 
is, for which g/a >> H. In this case we can find solutions using the WKB 
approximation for a completely arbitrary dependence of a(t) on time, so 
that our results will apply even in the presence of vacuum energy, whether 
or not it is constant. For this purpose, we must put Eq. (6.6.42) into 
a standard form with no first derivative term by introducing a new 
independent variable, 

x= [eo dt, 


so that Eq. (6.6.42) becomes 


dD 
dx? 


We write Dy = A exp(+ig f a* dx), and keep only terms in Eq. (6.6.48) of 
order q? and q. This gives dA/dx = —(A/2a”)da?/dx, so A « 1/a, and the 
WKB solutions for Dy are a~! exp(+ig f a? dx) = a7! exp(tig f a~! dt). 
The factor 1/a gives a gravitational wave energy density that decreases 
as a~*, a factor of a7! representing the redshift of individual gravitons, 
and a factor a7? arising from the dilution of gravitons as the universe 
expands.® 

To find the correct linear combination of the two WKB solutions, we have 
to match them to the solution (6.6.47) in the radiation-dominated era, by 
considering these solutions in the intermediate range where both g/a > H 
and y & 1. In the radiation-dominated era vacuum energy is presumably 
negligible, so here g/aH = k apQHEQ //2aH = Ky, and for k > 1 there 
does exist a range of y for which xy > 1 even though y < 1. In this range 
of y, we have 


+ga D=O. (6.6.48) 


E E A EAA 


— =K 
q a 4 Ha? a 


KY, 


5The standard way of eliminating the first derivative term in Eq. (6.6.42) is to change the dep- 
endent instead of the independent variable. For instance, see V. S. Mukhanov, H. A. Feldman, and 
R. H. Brandenberger, Phys. Rep. 215, 203 (1992); K. Ng and A. Speliotopoulos, Phys. Rev. D 52, 
2112 (1995); and more recently, J. R. Pritchard and M. Kamionkowski, Ann. Phys. 318, 2 (2005). 
The results obtained in this section are much simpler than with this standard method, because by 
changing the independent rather than the dependent variable we encounter no turning point in the 
WKB solution. 

®See G&C, Sec. 15.10. 
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so (recalling that y œ a) the linear combination of a7! exp(+ig f a`! dt) 
and a7! exp(—iq f a~! dt) that matches Eq. (6.6.47) where both are valid is 


RET t dt 7 
Dı = —sin(q | —~x] Di. (6.6.49) 
ky o a(t’) 
To repeat, this is the solution deep in the horizon, whatever the contents of 
the universe. For times near the present, this is 


J QRHe 
Dı = = sin () + k(t — to)) DS, (6.6.50) 
where k = q/ao, and n = kag i dt/a(t). 

It is possible that cosmological gravitational waves might be detected 
directly.’ Tensor modes detectable in this way would certainly have x > 1, 
the case we have been considering, so that they would enter the horizon 
much earlier than the time of recombination, and hence could provide an 
opportunity for a direct observation of the universe at very early times, and 
even for the exploration of physics at higher energies than can be reached by 
conventional particle accelerators. The existing ground-based Laser Inter- 
ferometer Gravitational-Wave Observatory (LIGO) operates at about 100 
Hz; perturbations with this frequency re-entered the horizon when the cos- 
mic temperature was about 108 GeV, but LIGO does not have the sensitiv- 
ity required to detect cosmological gravitational waves. But cosmological 
gravitational waves might be detected by space-borne laser interferometers. 
For instance, two detectors of cosmological gravitational waves have been 
under consideration: the Big Bang Observer in the U.S., and the Deci- 
hertz Laser Interferometer Gravitational Wave Observatory in Japan. Both 
operate at frequencies around 0.01 to 0.1 Hz, and according to Eq. (6.6.43), 
a gravitational wave with a frequency of gc/2ay = 107? Hz would have 
k = 1.3x10/Qyh? > 1. The wavelengths to which such detectors would 
be sensitive would short enough to have come into the horizon when the 
cosmic temperature was 104 to 10° GeV. In this case, the changes in the 
time dependence of a(t) associated with changes in the equation of state of 
matter at various annihilation thresholds produce distinctive features in the 
spectrum of the tensor modes. For reasonable assumptions about the pri- 


7For recent studies, see N. Seto, S. Kawamura, and T. Nakamura, Phys. Rev. Lett. 87, 221103 
(2001) [astro-ph/0108011]; A. Buonanno, gr-qc/0303085; A. Cooray, astro-ph/0503118; T. L. Smith, 
M. Kamionkowski, and A. Cooray, Phys. Rev. D 73, 023504 (2006) [astro-ph/0506422]; G. Efstathiou 
and S. Chongchitnan, Prog. Theor. Phys. Suppl. 163, 204 (2006) [astro-ph/0603118]; B. Friedman, A. 
Cooray, and A. Melchiorri, Phys. Rev. D 74, 123509 (2006) [astro-ph/0610220]. Also see NASA web 
page universe.nasa.gov/program/bbo.html. 

8Y. Watanabe and E. Komatsu, Phys. Rev. D 73, 123515 (2006) [astro-ph/0604176]. 
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mordial tensor spectrum, it seems likely’ that a tensor perturbation strong 
enough to be detected directly at short wavelengths would be detected also 
indirectly at longer wavelengths through its effect on the polarization of the 
cosmic microwave background, to be discussed in Section 7.4. 

In calculating the effect of cosmological gravitational waves on the cos- 
mic microwave background, we need also to consider values of «x of order 
unity or less. Let us therefore turn to the other case that can be treated ana- 
lytically, the case «x « 1. Here we still have g/a « a/a at matter—radiation 
equality, so the perturbation remains outside the horizon and D, (t) remains 
equal to the constant value D? until well into the matter-dominated era, 
when y > 1. In the limit y >> 1, and whatever the value of x, Eq. (6.6.42) 
becomes 


d’Dy 5dD} 3 
ZA 447D, =0. 6.51 
sm ts gy es 0 (6.6.51) 


This has two independent solutions, which can be written as functions 


of k/¥: 
2) (2/9) e — cos (2« v7) sin (2« v7) 


KS k?y + 2x3 y3/2 
E 2nı (2K /Y) 7 sin (2« v7) n COS (2« v7) (6 $ 52) 
kI k?y 2K3y3/2 rene 


In the matter-dominated era, g/aH = kagoHeg/[v2aH = k,/Y, SO we 
must impose the condition that the solution should approach the constant 
value D3 for k /y < 1. In order to satisfy this condition, we must exclude 
the second solution, which becomes singular for «,/y — 0. In this limit 
the first solution approaches the constant 4/3, so the correct solution when 
k X lis 


-cos (2x y7) sin (2e y 
GEA = n 5) T a 


(6.6.53) 


(Note that although Eq. (6.6.51) applies only for y > 1, Eq. (6.6.53) is 
valid for x < 1 and any y, because it correctly gives Dg(t) = Dj as long as 
k./y < 1, which for « < 1 applies until y > 1.) 

The interpolation between the cases k >> 1 and «k < 1 is simplest in 
the matter-dominated era, when y > 1. This is only a fair approximation 
at the time of the decoupling of matter and radiation, when y ~ 3, but it 


9 Smith, Kamionkowski, and Cooray, ref. 7. 
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becomes a good approximation thereafter. In this case the tensor modes are 
governed by Eq. (6.6.51), and the general solution is a linear combination 
of the two solutions (6.6.52), which for any x can conveniently be written 


3 DL UCK) — cos (2« VF + E(x) sin (2« VP + E(x) 
1 4x2 y a 2x y3/? í 
(6.6.54) 


where U and & are real dimensionless functions of «. From Eq. (6.6.53) we 
see that fork <1, 


Uk) —> 1, Bejficr 0, (6.6.55) 


which is why we chose to write the physical solution in the form (6.6.54). 
Also, for y > 1 Eq. (6.6.49) becomes 


sin (2x(/TFy — D) 
= D 


D an 6.6.56 
q Ky q ( ) 
and therefore for k > 1, 
4 
U(K) > = , Ew) > =e (6.6.57) 


The values of U (k) and E(x) for general x must be found by a computer 
calculation of the solution of Eq. (6.6.44). The results!? are presented in 
Table 6.2. It can be seen that the analytic asymptotic limits (6.6.55) and 
(6.6.57) agree quite well with the computer calculation fork « landk > 1, 
respectively. 

Now let us consider the effect of anisotropic inertia.!! The anisotropic 
inertia tensor is the sum of the contributions from photons and neutri- 
nos, but photons are have a short mean free time before the era of 
recombination, and make only a small contribution to the total energy den- 
sity afterwards, so their contribution to the anisotropic inertia is small. 
This leaves neutrinos (including antineutrinos), which have been travel- 
ling essentially without collisions!” since the temperature dropped below 


10D, Dicus, private communication. 

lig, Weinberg, ref. 3. For results of an earlier computer calculation, see J. R. Bond, in Cosmology and 
Large Scale Structure, Les Houches Session LX, eds. R. Schaeffer, J. Silk, and J. Zinn-Justin (Elsevier, 
Amsterdam, 1996). 

12For very short wavelengths that entered the horizon before the time of electron—positron annihila- 
tion, it is necessary to take into account collisions of neutrinos with each other and with electrons and 
positrons. This is considered by M. Lattanzi and G. Montani, Mod. Phys. Lett. A 20, 2607 (2005) 
[astro-ph/0508364]. 
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Table 6.2: The Tensor Transfer Functions 


K Uu E K Uu E 

0.001 | 1.0000 —2.991 x 107!! 1.8 | 2.990 —2.418 
0.002 | 1.0000 —9.557 x 107!° 2.0 | 3.250 —2.781 
0.003 | 1.0000 —7.239 x 107°? 2.5 | 3.906  —3.706 
0.004 | 1.0000 —9.236 x 10-8 3.0 | 4.565  —4.646 
0.005 | 1.0001 —9.236 x 1078 3.5 | 5.222 —5.597 
0.006 | 1.0001 —2.286 x 10-7 4.0 | 5.886  —6.555 
0.007 | 1.0002 —4.908 x 10-7 4.5) 6.551 —7.518 


0.008 | 1.0002 —9.498 x 1077 
0.009 | 1.0003 —1.697 x 1076 
0.01 | 1.0004 —2.847 x 1076 
0.02 | 1.0016 —7.848 x 1075 


0.03 | 1.0037 —4.655 x 1074 75 | 10538 —13347 
0.04 | 1.0065 —1.397 x 1073 80 | 11201 -14324 
0.05 | 1.0095 —2.809 x 1073 85 111.869 —15.302 
0.06 | 1.0128 —4.397 x 1073 90 | 12.538 —16.279 
0.07 1.0165 —6.023 x 1073 10.0 | 13.863 —18.237 
0.08 | 1.0210 —7.929 x 1073 11.0 | 15.202 —20.196 
0.09 | 1.0260 —1.044 x 1072 12.0 | 16.526 —22.157 
0.1 1.0310 —1.352 x 1072 13.0 | 17.867 —24.119 
0.2 1.0960 —5.802 x 107? 14.0 | 19.190 —26.081 
0.3 1.1800 —0.1293 15.0 | 20.532 —28.044 
0.4 1.2766 —0.2223 16.0 | 21.854 —30.001 
0.5 1.3816 —0.3327 17.0 | 23.197 —31.973 
0.6 1.4926 —0.4568 18.0 | 24.52  —33.94 
0.7 1.6079 —0.5919 19.0 | 25.86 —35.90 
0.8 1.7265 —0.7357 20.0 | 27.18 —37.87 
0.9 1.848  —0.8866 21.0 | 28.53 —39.83 
1.0 1.970  —1.043 22.0 | 29.85  —41.80 
1.1 2.095 —1.205 23.0 | 31.19  —43.77 
1.2 12.220 —1.370 24.0 | 32.51 —45.73 
1.4 |2.474 —1.171 25.0 | 33.86  —47.70 


1.6 2.731  —2.061 


about 10!° K, and which make up a good fraction of the energy density of 
the universe until cold dark matter becomes important, at a temperature 
about 104 K. The tensor part of the anisotropic inertia tensor is given by 
Eq. (6.6.40 ), so the gravitational wave equation (6.6.6) now becomes an 
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integro-differential equation!? 


A 


; 2 t t 
POTD OF SD, (0) = 646 p.(0) | x(a Í L 
a a 0 


a) D phe ae, 


(6.6.58) 


with K (v) given by Eq. (6.6.41). Note that despite the presence of anisotropic 
inertia, for g/a & H this has a solution with D,(¢) time-independent, in 
accordance with the general theorem of Section 5.4. 

This wave equation becomes particularly simple for wavelengths short 
enough to enter the horizon during the radiation-dominated era (though 
well after et—e~ annihilation), that is, for « >> 1. We will define the zero 
of time so that in this era a œ /t. It is convenient now to write Dg as a 


function of the variable 
t dt 2qt 
= = 6.6.59 
s af a) al)” ee 


instead of t. Using the Friedmann equation 87 Gp/3 = H? = 1/427, the 
gravitational wave equation (6.6.58) in the radiation-dominated era becomes 


d? 2d e iae poty 


— Dy(u) + ee ha + Dglu) = 
er 


py 3 7/8) - (4/11)*3 


ive B+, 1+3-(7/8)- (4/1147 


= 0.4052 . (6.6.61) 


Late in the radiation-dominated era, the factor 1/u* makes the right-hand 
side negligible, so D,(u) approaches a solution of the homogeneous equa- 
tion. In general, one might expect a linear combination of sin u/u and 
cos u/u, but in fact no cos u/u term appears in the solution.'* A numerical 
solution of Eq. (6.6.60) shows that if Dg(u) takes the value Dj for u < 1 
then for u >> 1 (but still in the radiation-dominated era) 


D,(u) > Dea mw) (6.6.62) 


13The lower bound on this integral should in principle be taken as the time of neutrino decoupling, 
at a temperature of about 10! K, but it is a good approximation to take it at a time tf = 0, defined by 
writing the scale factor during the radiation-dominated era as a œ 11/2, 

14D, Dicus and W. Repko, Phys. Rev. 72, 088302(2005) [astro-ph/0509096] have shown analytically 
that no cos u/u term appears in this solution, and have given an analytic solution as a rapidly convergent 
sum of spherical Bessel functions. The absence of a cos u/u term was shown very generally on causality 
grounds by S. Bashinsky, astro-ph/0505502. 
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where a = 0.8026. This then serves instead of Eq. (6.6.47) as an initial 
condition for the subsequent evolution of the gravitational wave amplitude, 
so that later, during the matter-dominated era, the effect of damping on the 
amplitude (neglecting 5) is simply to multiply the result given in Eq. (6.6.54) 
by a factor a.> 

The damping of gravitational waves of longer wavelength is considerably 
morecomplicated. Because ofa shift in the phase of the oscillation, the effect 
of anisotropic inertia on the amplitude of gravitational waves at the time 
of decoupling is a sensitive function of wave number, and for some wave 
numbers can even be an enhancement instead of a damping, but typically 
the amplitude is damped for « = O(1) by roughly 5%. 

Because the damping effect is small anyway for «x « 1, it will be an 
adequate approximation for all wavelengths to take the gravitational wave 
amplitude in the matter dominated era to be given by multiplying the result 
(6.6.54) with a factor a(x): 


A 3 DE Ulak) — cos (2 VF +6 («)) sin (2« VP + E(k) 
a 4K? y < 2x y3/2 í 
(6.6.63) 


with a(«) some function of « that rises smoothly from a(x) = 0.8026 for 
k > ltog(k) ~ 1 for« « 1. For instance, we can take (k) ~ (1 + 
.8026x)/(1 + K). All observable effects of cosmological gravitational waves 
will be reduced by this factor a(x). 


'SThe effect of possible neutrino masses and/or chemical potentials is considered by K. Ichiki, 
M. Yamaguchi, and J. Yokayama, Pub. Astron. Soc. Pacific 119, 30 (2007) [hep-ph/061 1121). 
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Anisotropies in the Microwave Sky 


We will now return to the theory of anisotropies in the cosmic microwave 
background, introduced in Section 2.6. In Section 7.1 we derive gen- 
eral formulas for the observed temperature fluctuation. Then in Sections 
7.2 and 7.3 we combine these results with the analysis of cosmic evolu- 
tion in Chapter 6 and introduce a series of approximations that simplify 
the evaluation of the multipole coefficients for scalar and tensor modes, 
respectively. Section 7.4 deals with the polarization of the microwave 
background. 


7.1 General formulas for the temperature fluctuation 


In this section we will derive general formulas for the contribution of scalar 
and tensor modes to the observed temperature fluctuation. We begin by car- 
rying the solution of the Boltzmann equations in synchronous gauge given 
in Section 6.1 forward to the present. When implemented with computer 
programs such as CMBfast or CAMB, the approach provides numerical 
results of great accuracy, but neither the derivation of the formulas for tem- 
perature fluctuations nor the computer programs are physically very trans- 
parent. We will then show how these results can be simplified by making the 
approximation of a sharp transition from thermal equilibrium to complete 
transparency at amoment tz of last scattering. This ignores the scattering of 
photons by matter that becomes reionized at a redshift of order 10. For tem- 
perature correlations, the corrections due to reionization are very simple for 
multipole orders £ greater than about 20, and will be included in Section 7.2. 
In Section 7.2 we will also partly make up for the approximation of a sharp 
drop from opacity to transparency by including effects of viscous damping 
during this transition, and by including effects of averaging over tz. (At the 
end of this section we will show how the same simplified results can also be 
obtained in a more general gauge by following photon trajectories from tz 
to the present, with no need to use the Boltzmann equation.) The results of 
the sudden-decoupling approximation obtained here will be used together 
with other approximations in Section 7.2 to derive analytic expressions for 
the temperature multipole coefficients, that require computer calculations 
only to carry out a single numerical integration. 

Because the proper energy density of black body radiation is proportional 
to the fourth power of the temperature, the fractional perturbation in 
temperature of radiation coming from a direction 7 is one-fourth the 
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fractional perturbation in the proper energy density of photons travelling 
in direction p = —f at our position x = 0 and time t = tọ. Eqs. (6.1.13) or 
(6.6.8) thus give 


AT(n 1 7 
= ra zis = 0,—f,t0), (7.1.1) 


where J;(x, p, t) is the fractional density matrix, defined by Eqs. (6.1.13) 
for scalar modes and (6.6.8) for tensor modes. Using the decompositions 
(6.1.18), (6.1.20) for scalar modes or (6.6.9), (6.6.13) for tensor modes, the 
scalar and tensor contributions to the temperature fluctuation are 


AT@\ 1 Na 
( 7 ’) B i | ea alg) AP (q, -Â Â, to) , (7.1.2) 
and 
AT@\P 1 ee N 
( a ’) Z 4 2 fès Pa, A) neni eki (d, YAŞ (q, —q4:n, to) > 


A=+2 
(7.1.3) 


where a(q) and £(q,à) are the stochastic parameters for whatever 
modes are assumed to dominate the scalar and tensor perturbations, respec- 
tively; exı(ĝ, A) is the polarization tensor defined in Section 6.6 for a grav- 
itational wave with wave number q and helicity A; and Am (q, u, t) and 
AR (q, u, t) are amplitudes appearing in the decompositions (6.1.20) and 
(6.6.13) of the scalar and tensor contributions to the fractional density 
matrix Jj. 

In order to display the angular dependence of the temperature shift, 
we use the line-of-sight integrals (6.1.37) and (6.1.38) for scalar modes 
and (6.6.25) and (6.6.26) for tensor modes, which give, for the scalar 
modes 


AT()\® 1 
(TR) maf Pea 


to to dt to 
<f dt exp E . af — -f dt’ wat) 
t t a(t’) t 


«| — 2Ag(t) + 2q A)? By(t) + 3a-( Og, 1) 


—4i (q + ft) W(t) dupg(t)/a(t) 


3 og 
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and for the tensor modes 


ATAN D 
( 2) - = D fe B(q, A) NEN] eKI(G, à) 


19 PED 
t i t 
f" dt exp jaa f al = “a wat) 
ti © a(t’) t 
«| — 2D, (t) + w(t) Yq, J : (7.1.5) 


As a reminder: tı is any time sufficiently early before recombination so that 
any photon present then would have been scattered many times before the 
present; œc(t) is the photon collision rate at time t; A,(t) and B4(t) are 
the scalar fields in the perturbation to the metric in synchronous gauge, 
defined in Section 5.2; ®(q, t) and TI (q, t) are the scalar source functions, 
defined by Eq. (6.1.21); 5ug,(t) is the scalar velocity potential of the bary- 
onic plasma; D4(t) is the gravitational wave amplitude, defined in Section 
5.2; and Y (q, t) is the tensor source function, defined by Eqs. (6.6.11) and 
(6.6.12). 

Eq. (7.1.4) does not give the expression for the scalar temperature fluc- 
tuation in its most convenient form. When we pass to the limit of a sharp 
moment of last scattering, its terms will not correspond to the decompo- 
sition of the fluctuation into Sachs-Wolfe, Doppler, intrinsic, and inte- 
grated Sachs—Wolfe terms, discussed in Section 2.6. The individual terms 
in Eq. (7.1.4) are not even invariant under the limited class of gauge trans- 
formations (5.3.39)-(5.3.42) that preserve the conditions for synchronous 
gauge. We will therefore rewrite Eq. (7.1.4) by using the identity 


t 
exp (ia: a a) (qf) By(t) = 


t t 
— exp (ia = | : oa (e OBO + aå) Bw) 


+ {eso (aa f2) 
dt Pa 1 a(t’) 


x [eoo + a(t)a(t)Bg(t) + ia(nq - AB, (0) | | (7.1.6) 


Using this in Eq. (7.1.4) and integrating by parts, yields our final formula 
for the scalar temperature fluctuation 


ean _ Cy x Cay FLN 


To To early To ISW 
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where the first, “early,” term receives contributions only from times with 
an appreciable free electron density, before recombination and after 


reionization, 
AT(Aay\“ 
( 7 ) = | qaw 
0 


early 


to to dt! to 
<f dt exp aaf Saas -f dt’ wt) 
ti © a(t’) t 


3 3 fe 
xo] 320; N+ 2(1-@-m) NG. 


1 i 1 bas 
—5 8 (OBO — saan By 
~i(q-A) (ura + ainb,(0)/2) ; (7.1.8) 


and the second, “integrated Sachs—Wolfe term,” receives contributions from 
the whole period from f¢; to the present, 


oe 1 ; to 
= —- | d-qa( f dt 
( To ISW 2 ROS ti 


to / to 
x exp E . af a2 J dt’ wt) 
© a(t’) t 


x T [40 +a? (t)By(t) + ana, | . (7.1.9) 


(Here we are ignoring a surface term in the integration by parts at t = tọ, 
because it is linear in 7, and therefore just contributes to the £ = 0 and £ = 1 
partial waves. This “late” term is calculated by a different method later in 
this section, and given in Eq.(7.1.38).) 

The integrated Sachs—Wolfe term (7.1.9) represents the effect of chang- 
ing gravitational fields during the passage of the microwave photons from 
last scattering to the present. As already noted in Section 2.6, the ISW term 
would vanish if the gravitational field from last scattering to the present (or 
more precisely, at times ¢ when the transparency exp (- f o welt’ )dt') was 
non-negligible) arose solely from the density of cold matter. (This feature 
provides another reason, apart from gauge invariance, for the rearrangement 
of terms that led to Eq. (7.1.7).) Under this approximation, Eqs. (6.5.15) 
and (6.5.16) show that 5pg œ 17/3 and wg « t13. Eq. (5.3.38) gives 
q’ Ag = Bra? ppSpq — 2Ha* Yq, both terms of which are constant in the 
matter-dominated era, so Aq does not contribute to Eq. (7.1.9). With Aq 
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negligible, Yg = —q7By, so By œ t~'/3, and therefore both terms a? B; and 
aiBy in Eq. (7.1.9) are time-independent, and therefore do not contribute 
to the ISW effect either. For this reason the ISW term (7.1.9) is relatively 
small. The early-time ISW effect depends sensitively on the ratio of matter 
to radiation at last scattering, which is an aid in using observations of tem- 
perature fluctuations to measure Qyh?. The late-time ISW effect receives 
its main contribution from times near the present, when the dark matter 
density falls below the vacuum energy density. Anisotropies subtend larger 
angles when viewed nearby than from great distances, so the late-time ISW 
term in the temperature anisotropy contributes to the temperature multipole 
coefficients Ce only for relatively small £, say £ < 20. It is the integrated 
Sachs—Wolfe effect that causes the predicted values of €(€ + 1)C¢ to rise 
as £ drops from around 20 to smaller values. This effect has been difficult 
to see in present data on the cosmic microwave background temperature 
fluctuations. ! 

It is the “early” term (7.1.8) that makes the largest contribution to the 
scalar temperature fluctuation for £ > 20. The terms in Eq. (7.1.8) pro- 
portional to q - Ê represent the Doppler effect,” while the other terms give 
the combined effect of gravitational time dilation and intrinsic temperature 
fluctuations. 

Let’s pause to check invariance under the limited set of gauge trans- 
formations that preserve the conditions for synchronous gauge. These trans- 
formations induce the changes Ab; = —21/@ and Adugy = T, Where t is 
an arbitrary function of x, so the Doppler term proportional to iq - Î in 


'The ISW effect can be detected through its correlation with inhomogeneities in the distribution 
of matter (which are also linear in a(q)), as suggested by R. G. Crittenden and N. Turok, Phys. Rev. 
Lett. 76, 575 (1996) [astro-ph/9510072]. This effect has been seen in the correlation of data from the 
WMAP satellite (discussed in the next section) with various surveys, by P. Fosalba and E. Gaztañaga, 
Mon. Not. Roy. Astron. Soc. 350, L37 (2004) [astro-ph/0305468]; P. Fosalba, E. Gaztañaga, and 
F. Castander, Astrophys. J. 597, L89 (2003) [astro-ph/0307249]; N. Ashfordi, Y-S. Loh, and M. A. 
Strauss, astro-ph/0308260; S. P. Boughn and R. G. Crittenden, New Astron. Rev. 49, 75 (2005) [astro- 
ph/0404470]; N. Padmanabhan, C. M. Hirata, U. Seljak, D. Schlegel, J. Brinkmann and D. P. Schneider, 
Phys. Rev. D 72, 043525 (2005) [astro-ph/0410360]. More recent cross-correlations of anisotropies seen 
by the three-year Wilkinson Microwave Anisotropy Probe (discussed in the next section) with galaxies 
in the Sloan Digital Sky Survey and with radio galaxy data from the NRAO VLA Sky Survey give 
0.7 < Qy < 0.82 and 0.3 < Qy < 0.8 at a 95% confidence level, respectively; see A. Cabré et al., 
astro-ph/0603690; D. Pietrobon, A. Balbi, and D. Marinucci, Phys. Rev. D 74, 043524 (2006) [astro- 
ph/0606475]. Unfortunately, cosmic variance limits the accuracy with which this approach can be used 
to study the time dependence of the vacuum energy. 

2Ina gauge in which cold dark matter remains at rest, the effect of gravitational perturbations on 
the cold dark matter particles is canceled by the definition of surfaces of equal time, so in this gauge the 
velocity perturbations of the baryonic plasma arise solely from pressure forces, not from gravitational 
forces. This is why the gravitational term proportional to By appears accompanying the plasma velocity 
potential in Eq. (7.1.8); it represents the velocity that in a different gauge would be given to the photon— 
baryon plasma by gravitational forces. 
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Eq. (7.1.8) is separately gauge invariant. Also, A® = —(4/3)Tt/T = 
(4/3)at/a and ATI = 0, so the other terms in Eq. (7.1.8) are also gauge 
invariant; and AA = —2dat/a, so also Eq. (7.1.9) is invariant under this 
limited set of gauge transformations. 

Our results so far are exact, aside from the use of first-order perturbation 
theory and the assumption that photons interact only through purely elastic 
Thomson scattering. (In Eqs. (7.1.2) and (7.1.3) we are assuming that the 
scalar and tensor fluctuations are each dominated by a single mode, but it 
would be trivial to introduce a sum over modes in these expressions.) These 
results are equivalent to those given by Seljak and Zaldarriaga,* which are 
used in computer programs like CMBfast and CAMB. At the cost of only 
a small loss of numerical accuracy, they can be greatly simplified if we now 
make the approximation of a sharp transition from thermal equilibrium to 
perfect transparency at a definite time tr. 

The integrand of the “early” contribution (7.1.8) to the scalar tempe- 
rature perturbation contains a factor P(t) equal to the probability distribu- 
tion of the last photon scattering 


to 
P(t) = we(t) exp (- | dt’ ot) ; (7.1.10) 
t 
Assuming that w,(t) drops sharply at time tz from a value much greater than 


the expansion rate to zero, the function P(t) is non-zero only in a narrow 
interval around tz. But P(t) is a normalized probability distribution: 


t 
[ roast (7.1.11) 
ti 


so the integral over t in Eq. (7.1.8) can be evaluated by dropping the factor 
P(t) and setting t = tr: 


os J j [ af” a] 
x | dqa exp |iq-i — 
( Ti.) cae q a(q) exp | iq at) 


3 3 ee 
«| Foun) + (1 - Q-A) I (q, tL) 


3This follows from the rule (5.3.42), that the change Ads in the perturbation ôs to a scalar s with 
unperturbed value 5 is As = —St. In using this rule, we note that the unperturbed photon distribution 
is isotropic and unpolarized, so that, of the terms in Eqs. (6.1.29) and (6.1.30) for ® and TI, the only 
one that has an unperturbed value is ayy /3. For the purposes of assessing the gauge transformation 


property of this term, we must define the unperturbed value of Ay so that its perturbation is the 


fractional photon density fluctuation 48T /T, and so its unperturbed value is 41n T. 
4U, Seljak and M. Zaldarriaga, Astrophys. J. 459, 437 (1996) [astro-ph/9603033]. 
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1, + 1 p : 
Z3% (OBL) — alin )at,) Bq (tL) 


—iq- )(Sug(tr)/aCte) + attr) Bg(tr) ?2)| 7.142) 


Under the same assumption of a rapid drop in w,(t) from a large value for 
t < ty toa negligible value for t > tz, the factor exp (- i dt’ welt’) in 


Eq. (7.1.9) rises sharply from zero for t < tz to unity for t > tz, so Eq (7.1.9) 
becomes 


oe f 3 f" [ if” a] 
x—- Id dt exp | iq- ñ an 
( To Jisw 5 q a(q) a p | iq U 


x Tfno +a? (t)Bg(t) + ana B, (0 | (7.1.13) 


The same approximation applied to the tensor contribution (7.1.5) gives 


ATA)\™ 1 
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To ASEZ 
2 f" abao ( f a) 
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+Y (q, tL) exp (ia . af =a) i (7.1.14) 
tL a(t ) 
Further, in local thermal equilibrium photons are unpolarized and have an 


isotropic momentum distribution, so AO AS , and KO vanish for all 


£, and Am vanishes for all £ except £ = 0, so that the formulas (6.1.29), 
(6.1.30), and (6.6.21) for the source functions give I = YW = 0 and ® = 
AS) 73 = 48T /3T in local thermal equilibrium. The assumption of a 
sharp drop from a very high to a very low photon collision frequency then 


5This can be seen formally by taking the limit we —> oo in the Boltzmann equations (6.1.27), (6.1.28), 
(6.6.18), and (6.6.19). This gives 


aS) = Ay +8 aS — (3044) 494108 
Pe = zo + 02) > TED LIP TS co + 5 Mde2 , 


T T 
AFL = SoY, AL? = sot. 


The formulas (6.1.29), (6.1.30), and (6.6.21) for the source functions then read ® = ®—J1/12, M = 3117/5, 
and YW = 7/10, which require that TI = ¥ = 0 and d = A: 
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allows Eqs. (7.1.12) and (7.1.14) to be further simplified to 


ror | 3 [ af" A 
x | dqa exp | iq: -ñ — 
( Tory ais q a(q) exp | iq 1 at) 


x KO +iG- MGA) ; (7.1.15) 


where F(q) and G(q) are the form factors 


6Ty(tr) 1 Fe 1 : ; 
FO = Fi T POBUD - sar alr Bglt) (1119 
Gq) = —4(5uya(t1)/altr) + altr)bqlt1)/2) » (7.1.17) 


and 


AT@\ "1 
( 2) ==5 Ð | da baD Aut end. 


To SEZ 
f'a 7 to dt! 
x t Dg(t) ex ( af a) ; 7.1.18 
eas all at 
x k k 


We will now make the same approximation, of a sudden drop in opacity 
at tz, and use it to derive formulas for the scalar and tensor temperature 
fluctuations by following photon trajectories, without needing to use the 
Boltzmann equation formalism of Section 6.1. Because it is easy, we will 
carry out this derivation in a more general class of gauges. After this deriva- 
tion, we will check that it yields the results (7.1.15) and (7.1.13) for scalar 
fluctuations in synchronous gauge, and (7.1.18) for tensor fluctuations. The 
reader who is comfortable with the derivation of these formulas using the 
Boltzmann equation in synchronous gauge may want to skip the rest of this 
section. 

We start with some general remarks, that apply equally to scalar and 
tensor perturbations, and that for scalar perturbations apply in any gauge 
in which gj9 = 0, including both Newtonian and synchronous gauge. Con- 
tinuing to neglect a possible unperturbed spatial curvature, we write the 
perturbed metric in any gauge with g;ọ = 0 in the form 


g0 =-1- EQ, N, go=0, gysa(Odjt+hy(x.t. (7.1.19) 


A light ray travelling toward the center of the Robertson—Walker coordinate 
system from the direction 7 will have a co-moving radial coordinate r related 
to t by 


0 = gudx" ae = -(1 + E(rn, ))de + (ew + hr (rÀ, 1) dr , (7.1.20) 
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or in other words 


-1/2 
dr (S) z 1- hy E 


rF 


-+~ —. 7.1.21 
1+E rare 2a ( ) 


(To first order in perturbations we don’t have to worry about a deflection 
of the ray from the radial direction, because 2,4 = 2,4 = 0, so that any 
deflection would have to be of first order, and its effect in the term /jjdx'dx/ 
would therefore be of second order.) 

We now make the approximation that the transition of cosmic mat- 
ter from opacity to transparency occurred suddenly at a time tz of last 
scattering, at a red shift 1 + zz ~ 1090. With this approximation, the 
relevant first-order solution of Eq. (7.1.21) is 


t dt! 
t) = s(t ——N(s(t’)n, t') , 7.1.22 
r(t) sos f a (s€ )ñ, ) ( ) 
where ThA 
rr(X, t 
=- — E .1.2 
Newt) = 5 [EO — Bex] (7.1.23) 
and s(t) is the zeroth order solution for the radial coordinate which has the 
value rz at t = tz: 
t dt to dt 
t) =rL— = ; 7.1.24 
or J, a(t’) j a(t’) vee 


In particular, if the ray reaches r = 0 at a time fo, then Eq. (7.1.22) gives 
dt dt 
0= s(t —N(s(t)n,t) = — (N(s(Hn,t)—1). 
sc) +f a (s€ n ) rot ff man (s€ Jñ ) ) 
(7.1.25) 


A time interval ôtz between the departure of successive light wave crests 
at the time tz, of last scattering produces a time interval tọ between arrival 
of successive crests at tọ given by the variation of Eq. (7.1.25): 


oty 5 n dt (ae) 
0= + |1 N(rri t) +| — (21 
a(tz) l (r 1) u a(t) BF Jae 
n ôto 
r aa a — 
+511 bu (rit, tL) +- el 1+N(O,t0)]. 


(7.1.26) 


(The term on the right-hand side involving the radial velocity dw; of the 
photon gas or photon-electron—nucleon fluid arises from the change with 
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time of the radial coordinate rz of the light source in Eq. (7.1.25). We 
don’t consider the variation of the argument s(t)n in N, because to zeroth 
order rz and tz are related in such a way that s(t) = 0 for all rz, so its 
variation with rz is of first order, and the effect of this variation on N would 
be of second order.) The total rate of change of the quantity N(s(t)n, t) in 
Eq. (7.1.25) is 


d 2 a E 1 ƏN (rÀ, t) 
SN (sii. r) ` (Noh, r a (ame a = ; 


so Eq. (7.1.26) may be written 


ot f | ð x | 
0= 1 — N (0, to) + dt t —N (rì, t 
a(t) l i g tL ot i ) r=s(t) 


+ôtr ul, (rrû, t1) + aj- 1+N(, 10)] . (7.1.27) 


This gives the ratio of the coordinate time interval between crests when 
emitted and received, but what we want is the ratio of the proper time 
intervals 


ÔTL = VI+E(ry, tr) tL, dst = y 1 + E(0, to) dto , (7.1.28) 


which to first order gives the ratio of the received and emitted frequencies 
as 


vo _ TL _ atx) 1 . 
VL ÔTO a(to) | z 2 ( (rN, tL) (0, t0)) 


to a 
— J | —N (rì, n| dt 
tL ot r=s(t) 


— a(t) ôu, (ra, J ) (7.1.29) 


The temperature observed at the present time #9 coming from direction 7 
is then 


T = (vo/vz)(Ttr) + 87h, t1)) 


where now we have added a term 57 to take account of the intrinsic 
temperature fluctuation at time tz. Likewise, in the absence of perturbations 
the temperature observed in all directions would be 


= (acr) /a(w)) T (tz) , 
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so the fractional shift in the radiation temperature observed coming from 
direction Î from its unperturbed value is® 


AT(n) = T) — To _ vo 7 oT (rn, tL) 
To To ~~ a(tt)vL/a(to) T (tr) 
1 7 to ð A 
= 5(£trzi, tr) - EO,10)) - i. di {5 (ri D 
oT (rn, tL) 


— a(t) u, (ria, tL) + (7.1.30) 


T (tz) 
Because tensor and scalar perturbations are uncorrelated, we will treat 
their contribution to Ce independently. 
For scalar perturbations in any gauge with hj =0, the metric 
perturbation is given by Eqs. (5.1.31)-(5.1.33) as 


hoo = —E hy = Aé;; + —— (7 1.31) 
n 3 Nij a jj TARE l. 
o 4 ý Ox! dx! 


Also for scalar perturbations the radial photon fluid velocity is given in 
terms of a velocity potential du, as 


grudu _ 1 duy 


ôu! = = 
Y axe &@ ðr 


(7.1.32) 


Thus Eq. (7.1.30) gives the scalar contribution to the temperature 
fluctuation 


(=e) = 5 (Eth, t1) — EO, 1)) — J E | SN (ri ‘)} 


tL r=s(t) 

1 a6 n, t dT (rn, t 

E ( iy (rn, tL) ) qo (7.1.33) 
a(tz) or an T (tz) 

where now 
1 3? B 
N=-|4+— —E]l. 7.1.34 
2 l t ər? | ( ) 


Eq. (7.1.33) is not in the form that is most useful for our purposes. The 
total temperature fluctuation given by Eq. (7.1.33) is invariant with respect 
to the limited class of gauge transformations that leave g;ọ = 0, but this is 
not true of its individual terms, including even the integral of (0N/0t);=s(1). 


6This result is essentially that first found by R. K. Sachs and A. M. Wolfe, Astrophys. J. 147, 73 
(1967). 
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It will be much more convenient to rewrite Eq. (7.1.33) in a way that leaves 
the integral term separately gauge invariant. 
For this purpose, we make use of an identity corresponding to Eq. (7.1.6): 


02 B(rh, t) 
ar 
r=s(t) 


= ae [ota t) + a(t)a(t) Bra, t) + aoto] | 
r=s(t) 


n | “ (a OODE TOOGA »)| 


r=s(t) 


Together with Eq. (7.1.34), this gives the integrand in Eq. (7.1.33) as 


ð K 
| ZN (râ, r) loss 


ee [ents 1) taaah, ) +a) eee? 2| | 
r=s(t) 


2 dt or 
1f{9a 2 $ 
+ z HG (t)B(rn, t) + a(t)a(t)B(rn, t) 
+A(rii, t) — EÂ, »)| (7.1.35) 
r=s(t) 


The scalar fractional temperature fluctuation (7.1.33) may therefore be 
written 


AT(Aa)\* (Arey (ane) (5e) 
= , (7.1.36 
( To ) To early : To late į To ISW : 


where 


AT(Aa)\* 1 .. 1 , 
( 2) = ~<a" (ty) B(rza, tL) — -altr )åltL)B (LÂ, tL) 
To early 2 2 


dT (rn, tL) 
T (tr) 
ð z 
—a(tr) E G Bra, tt) + = Bap Nn w))| ; 


r=ry 


(7.1.37) 


1 R 
+ zE CI, tL) + 
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a\\ § 
(=>) = 5 (to) BO, to) + Latojat BO, to) 
late 


l 3 1. E buy (rn, to) 
—5E(,t0) + alto) E (zo, Di a) o 
(7.1.38) 


ANS 
(A) = =f a|? (e (1) Bra, t) + a(t)a(t) Bra, t) 
T ISW 2 tL 

+A(rh, t) — E(rh, »)| ; (7.1.39) 


r=s(t) 


The “late” term (7.1.38) (which was ignored earlier in this section) is the 
sum of a direction-independent term and a term proportional to, which has 
been added to represent the anisotropy due to the local cosmic gravitational 
field and velocity. It only affects terms in the multipole expansion of the 
temperature correlation function with £ = 0 and £ = 1, so it can be ignored 
if from now on we consider only multipole orders £ > 2. 

Now let us check the separate gauge invariance of the ISW, Doppler, 
and the remaining combined terms, at least for the limited class of gauge 
transformations that preserve the condition g;ọ = 0, or in the notation of 
Eq. (5.1.32), F = 0. Eq. (5.3.13) tells us that these gauge transformations 


have 
ə 
Ly 
D at (5 s) i 


and so AB = 2eọ/a?. According to Eqs. (5.3.13) and (5.3.16), a gauge 
transformation therefore shifts the terms —a?B/2, —aaB/2, E/2, and 8T /T 
in the combined gravitational and intrinsic temperature fluctuations by the 
amounts —a*d(e9/a’)/dt, —deq/a, ĉo, and ceoT/T = —eoå/a, giving no 
net change; it shifts the terms aB/2 and du,/a in the Doppler contribu- 
tion by €o/a and —€0/a, giving no net change; and it shifts the four terms 
a&@ B, 2aaB, A, and —E in the integrand of the integrated Sachs—Wolfe 
term by the amounts 2é9 — 4¢€9a/a, 2eoå/a, 2eoå/a, and —2€o, respec- 
tively, giving no net change. Thus the integrated Sachs—Wolfe term, the 
Doppler term, and the combined gravitational and intrinsic temperature 
terms are separately gauge invariant. In particular, both Newtonian and 
synchronous gauge will give the same results for each of these three 
contributions. 
Let us now assume that from last scattering to the present the scalar 
contributions to the fluctuations are dominated by a single mode, so that 


341 


7  Anisotropies in the Microwave Sky 


any perturbation X (x, t) (such as B, E, du,, or ôT) can be written as 


X(x,f) = [ aew eX, (0) (7.1.40) 
with a(q) a stochastic variable (the same for all X), normalized so that 
(atq) @*(q’)) = 8° (q—q’). (7.1.41) 
Then Eqs. (7.1.37) and (7.1.39) give 
AS 
(7) = J Bg alq) et) (F@ +ig-i G@) (7.1.42) 
To early 
~i 1 ik J 3 iq-fis(1) A Dore 
= —- dt |d qas) — Balt 
( To ie Die. qa(q) e i a(t) Bg(t) 
+ a(t a(n By) + Ag) — BO) (7.1.43) 
where 
1 E 1 ads. te 1 8T (tL) 
F(q) = -5P (1) By (tn) ~ sa(tn)a(te) atr) + 5 Eg (te) + aS 
(7.1.44) 
1 ; 
Gq) = -q G a(ty) Bg (tL) te alt) uyl) (7.1.45) 


As we have seen, the form factors F(q) and G(q) and the integrand of the 
ISW term are separately invariant under gauge transformations that leave 
gio equal to zero. 

In synchronous gauge E, = 0, so Eqs. (7.1.42)(7.1.45) are the same as 
the previously derived results (7.1.15), (7.1.13), (7.1.17), and (7.1.18). In 
Newtonian gauge B = 0 and E = 29, so the form factors are 


ve 
F@ = 9,2) + 2 | (7.1.46) 
T(t) 
1 
EEE T A (7.1.47) 


~ a(t) 


(Of course, 5T, and ôuyq are different in Newtonian gauge from what they 
are in synchronous gauge.) Also, the integrated Sachs—Wolfe term is 


A t ~ . 
(a) = i ‘dt J dqa) Ob. (7.1.48) 
ISW tL 


To 


It is left as an exercise for the reader to show that ®, is constant during 
the matter dominated era. Therefore, as we saw earlier in synchronous 
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gauge, the integrated Sachs—Wolfe effect receives contributions only from 
departures from strict matter dominance. 

Finally, we consider the tensor contribution to temperature fluctuations. 
In the approximation of a sudden transition from thermal equilibrium to 
transparency at time tz, the only contribution of tensor perturbations to the 
observed temperature anisotropy comes from the term h, /2a’ in the defi- 
nition (7.1.23) of N, which according to Eq. (5.1.33) contains a term D,,/2. 
Using this in Eq. (7.1.29) gives the tensor contribution to the temperature 
fluctuation 


ATANT sty (OJ A 
( 2) _ Ah { = py«ri. | J (7.1.49) 
To 2 Ja (2t r=s() 


The gravitational wave amplitude Dj,(x, t) can be expressed as a Fourier 
integral (5.2.21) 


Dy, = >- J dq eta ej, XBA, A) DIO) , (7.1.50) 


A=t2 
where D,(t) is the dominant solution of the wave equation (5.2.16): 
Aa a. g T 
Dat) + 3-Dg(t) + [Dy = lór G r; (t), (7.1.51) 
a a 
ej(A,q) are polarization tensors defined in Section 5.2, and (q, à) is a 


stochastic variable chosen to satisfy Eq. (5.2.22). The tensor mode contri- 
bution to the temperature fluctuation is then 


ATANT 1 
( To ) =-3 2 


A=+2 


to ; x h 
J Aq ñiñjej O, B.A) J dt BOW Dy iN 
tL 
(7.1.52) 


This is the same as our previously derived result (7.1.18). We will return to 
the tensor term in Cọ in Section 7.3, after we have studied the scalar term 
in the next section. 


7.2 Temperature multipole coefficients: Scalar modes 
We will now apply the results of the previous section to the calculation 


of the contribution C?,., of scalar modes to the multipole coefficients of 
temperature—temperature angular correlations: 


1 
Crre = | [ei Pa (ATA ATAD), (7.2.1) 
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where AT(n) is the stochastic variable giving the departure from the 
mean of the temperature observed in the direction 7, and (...) denotes an 
average over the position of the observer, or equivalently, over the sequence 
of accidents that led to the particular pattern of temperature fluctuations 
we observe. (We are including a label TT to distinguish this multipole coef- 
ficient from those in temperature—polarization or polarization—polarization 
correlations, which are the subject of Section 7.4.) Of course this is not what 
is observed; the observed quantity is 


I 
CH, = re J ai J di! PA AN AT (A) AT) , 


but as shown in Section 2.6, the cosmic variance, the mean square fractional 
difference between this and Eq. (7.2.1), is 2/2 + 1, and therefore may be 
neglected for >> 1. In this section we will consider only the contribution 
Cr. , of scalar modes to Crr,e; as we saw in Section 5.2, tensor and scalar 
modes do not interfere, so we can take up the contribution of tensor modes 
separately in the following section. 

First let’s apply the results that were obtained in the previous section 
by using the kinetic theory approach described in Section 6.1. The use of 
the Boltzmann equation yields formulas (7.1.7)-(7.1.9) for the temperature 
fluctuation. To calculate the coefficients in a partial wave expansion of 
the temperature fluctuation, we use the familiar expansion (2.6.16) of a 
plane wave in Legendre polynomials, together with the addition theorem 
for spherical harmonics: 


ee) £ 
el AnS Y OYO . (7.2.2) 
£=0 m=—£ 


Using this in Eqs. (7.1.7)-(7.1.9), and replacing factors of iĝ -n with deriva- 
tives of the spherical Bessel function je, the scalar contribution to the tem- 
perature fluctuation observed in a direction 7 is given by 


(4 T()) ae YO) (7.2.3) 


£m 


where 
to 
a$ pm = Ari’ Tp J dq a(q) YI”) Í dt 
1 


x [i(ar@) Fad +h(ar0) Ga, d +j (arO) Had] . 
(7.2.4) 
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Here a(q) is the stochastic parameter for the dominant scalar mode, 
normalized to satisfy Eq. (5.2.7): 


(a(q) «*(q’)) = 8q- q’ ; (7.2.5) 


tı is any time sufficiently early before recombination so that a photon present 
then would have scattered many times before the present; r(t) is the radial 
coordinate of a point from which light emitted at time t would reach us at 


the present time to; 
0 dt’ 
N= Baa 7.2.6 
OS aa Ea 


and F (q, t), G(q, t), and H (q, t) are time-dependent form-factors, given by 


ti 
F(q, t) = exp -f : w(t’) ar | 
t 
3 3 
| ang Eas 
-37 (B®) — 77a q(t) 


ld +s wee 
arr EZO +a (HBA) + aaO By | | ARZA) 


t 
G(q, À) = —q w(t) exp |- J “eet” ar | 
t 
x [dupg(t)/a(t) + a(t) By (0)/2] , (7.2.8) 
t 
H(q,t) = Žo) exp |- J i welt") ar | (q, t). (7.2.9) 
t 


As a reminder: @,(t) is the photon collision frequency at time t; A,(t) 
and B,(t) are the scalar fields in the perturbation to the metric in syn- 
chronous gauge, defined in Section 5.2; ®(q, t) and TI (q, t) are the scalar 
source functions, defined by Eq. (6.1.21); and ôupgq(t) is the scalar veloc- 
ity potential of the baryonic plasma. A subscript T has been appended 
to the aem introduced in Section 2.6, to indicate that these are partial 
wave coefficients in the temperature rather than the polarization, and a 
superscript S has been included to distinguish scalar from tensor 
contributions. 

Together with the orthonormality property (2.6.19) of Legendre 
polynomials, Eqs. (7.2.1) and (7.2.3)-(7.2.5) give the scalar multipole 
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coefficients! 
Cc 
, 0 


[-a[i(aro) Fan 


ti(ar) 6.0 +(aro) nao]. 0210 


x 


This formula gives results of high accuracy, but the computer calculations 
used to calculate the multipole coefficients Chr , in this way are not partic- 
ularly revealing. Instead, we will apply a series of approximations that lead 
to a simple analytic formula for the Chr e 

First, we will neglect the integrated Sachs—Wolfe effect, given by the last 
term in Eq. (7.2.7). This effect is important only for relatively small values 
of £, where cosmic variance intrudes on measurements of Cher, i 

Next, we assume a sudden transition from perfect opacity to perfect 
transparency at a definite time tz, and a single dominant mode of 
perturbation. The fractional temperature fluctuation then takes the form 
(7.1.15): 


Gun ) = J Pq a P(E) + ig- àA), (7.2.11) 
To 

which was also derived at the end of Section 7.1 by following photon tra- 
jectories after the time of last scattering. (Here rz = r(tz) is the co-moving 
radius of the surface of last scattering.) For the present we will not use 
formulas (7.1.16) and (7.1.17) for the form factors F (q) and G(q), but will 
proceed for general form factors, returning later to the specific form factors 
(7.1.16) and (7.1.17). 

Using Eq. (7.2.2) in Eq. (7.2.11), and replacing iĝ -ñin the Doppler term 
with 3/3 (qr), we again have the partial-wave expansion (7.2.3), but this 


' The derivatives of the spherical Bessel functions in Eq. (7.2.10) can be expressed as time derivatives, 
and then integrating by parts this can be written at the integral of a single form factor times /¢( qr(t) ). 
The result is equivalent to Eq. (16) of M. Zaldarriaga and U. Seljak, Phys. Rev. D 55, 1830 (1997) 
[astro-ph/9609170], except for a difference of normalization: the source functions I and ® used here 
are 4 times those of Zaldarriaga and Seljak. 

2To correct for whatever inaccuracy is introduced by this approximation, we will later include the 
effect of damping of acoustic oscillations before tz, and average the temperature fluctuation over the 
time of last scattering. 
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time with 
dfin =4ri To | dq aO YO [ilar P+iarr)G@)| - (1.2.12) 


Inserting this in Eq. (7.2.1) and using Eq. (7.2.5) gives the multipole 
coefficients 


i g = 
Core DTF. ` laem? = 16x73 f g° dq 
m=—¢ 
j 2 
x [iar F@ + iar) G@)| (7.2.13) 


This is a standard result, but it does not provide transparent information 
about the dependence of Cre on £. For this purpose, we now make a 
further approximation: we specialize to the most interesting case of large 
£, where cosmic variance can be neglected. In this case, we can use an 
approximate formula for the spherical Bessel functions:? 


cos b cos [vtan — b) — 2/4] /vVsinb p>v 
Jelp) > 
0 p<v, 
(7.2.14) 


where v = £+ 1/2, and cos b = v/p, with 0 < b < 2/2. This approximation 
is valid for |v? — p?| > v4/3. Hence for £ >> 1, this formula can be used 
over most of the ranges of integration in Eq. (7.2.13). Furthermore, for 
p > v > | the phase v(tan b — b) in Eq. (7.2.14) is a very rapidly increasing 
function of p, so the derivative acting on the spherical Bessel function in 
Eq. (7.2.13) can be taken to act chiefly on this phase: 


— cos b v sin b sin [vtan b — b) — 2/4] /v p>v 

Jep) > 

0 p<v, 
(7.2.15) 


(Letting the derivative act on the factor 1/v sin b in Eq. (7.2.14) introduces 
an apparent divergence in the integral at ọ = v, but this divergence is 
spurious; for p very close to v the approximation (7.2.14) breaks down, and 


3 See, e.g., I. S. Gradshteyn & I. M. Ryzhik, Table of Integrals, Series, and Products, translated, 
corrected and enlarged by A. Jeffrey (Academic Press, New York, 1980): formula 8.453.1. 
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there is no singularity.) Using these limits in Eq. (7.2.13) and changing the 
variable of integration from q to b = cos! (v/qr L) gives 


l6n2v 7/2 db 
Chee = j r 


3 2 
ri cosł b 


x lr ( ) cos[v(tan b — b) — 1/4] 
rL cosb 


sind G ( 


2 
) sin[v(tan b — b) — n/41 . (7.2.16) 
rg cosb 


For v > | the functions cos[v(tan b — b) — 7/4] and sin[v(tan b — b) — x /4] 
oscillate very rapidly, so cos?[v (tan b— b) — x /4] and sin? [v (tan b—b)—7/4] 
average to 1/2, while cos[v(tan b — b) — 2/4] sin[v(tan b — b) — 1/4] averages 
to zero. Dropping the distinction between £ and v = £ + 1/2, and changing 
the variable of integration again, from b to 6B = 1/cosb, Eq. (7.2.16) then 
becomes* 
8703 f” d 
e+ ICP, = -5 PRE 


nh yP 


K 
x |r (+) $ a Lg (£) ; (7.2.17) 


We will see that the form factors F (q) and G(q) fall off rapidly for large q, 
in part because IR’? decreases more or less like g~3, so the integral over 
b converges at 6 = oo, and in fact is dominated by small values of 6. The 
integral of the F? term thus receives its greatest contribution from £ ~ 1, 
or in other words, for q ~ £/rL, or g/ar ~ €/d4, where d4 = rraz is the 
angular diameter distance of the surface of last scattering. On the other 
hand, the factor 6? — 1 multiplying G? in Eq. (7.2.17) kills the contribution 
of 6 values very close to unity, so the Doppler term proportional to G? 
makes a relatively small contribution to the multipole coefficients. 

Even without a detailed calculation of the form factors F(q) and G(q), 
we know that they depend on the baryon and total matter densities at 
last scattering, which for a given present microwave background temper- 
ature can be expressed in terms of Qgh? and Qyh’, but since spatial cur- 
vature and dark energy are (presumably) negligible at last scattering, the 
form factors cannot depend on Qg or Qa or Ho. Thus there is a high 


4A more rigorous but rather more complicated derivation was given by S. Weinberg, Phys. Rev. D 
64, 123512 (2001) [astro-ph/0103281]. The contribution of F(q) had earlier been calculated by J. R. 
Bond, “Theory and Observations of the Cosmic Background Radiation,” in Cosmology and Large Scale 
Structure, eds. R. Schaeffer, J. Silk, M. Spiro and J. Zinn-Justin (Elsevier, 1996), Section 5.1.3. 
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degree of degeneracy here; all dependence on Qx or Qa or Ho can only 
enter in the single parameter rz, the Robertson—Walker radius of the sur- 
face of last scattering. If we assume as commonly done that Qg = 0, 
so that Qa, ~ 1 — Qm, then temperature anisotropies can tell us the val- 
ues of Ho as well as Qy and Qs, but they cannot tell us that Qx = 0 
unless we have information about Ho from other source. Likewise, they 
cannot distinguish quintessence from a constant vacuum energy. Further- 
more, ry, and £ appear in €(€ + I) Chr only in the ratio €/rz, so the val- 
ues of Qg, Qa, and Ho can only effect the scale of the €-dependence of 
L(+ DC Pre: For instance, the values of these parameters can affect the 
positions of the peaks in £(€+ 1) Co, , byacommon factor, but cannot affect 
their heights. ' 

We next make the approximation that the gravitational field perturba- 
tions at last scattering are dominated by perturbations in the dark matter 
density. We already remarked in connection with the ISW effect in the previ- 
ous section that in this case Aq vanishes (because each term on the right-hand 
side of Eq. (5.3.38) is time-independent). The field Yq whose evolution we 
followed in Chapter 6 is defined in general as Wy = (34, — q7By)/2, so 
here B, = —2W,/q’, and since Eq. (6.5.16) gives Yg x 1~!/3, we have also 
By = 2W4/3tq?. Also, a « t’, so Eqs. (7.1.16) and (7.1.17) give the form 
factors as 


ol a? (tt) Yaltı) 
F(q) = 3ovq(tL) + 3t (7.2.18) 
G(q) = —qôuyq(tL)/a (tL) + AL) Yq ltL)/4 , (7.2.19) 


in which we have used ôT; /T = dpyg/4Py = byq/3. 

As our next approximation, we shall use the results (6.5.16), (6.5.17), 
and (6.5.18) of our analysis of cosmic evolution in Chapter 6, in which we 
neglected the baryon/dark matter density ratio: 


39° tLROT (K) 


; 7.2.20 
sat (7.2.20) 


Wa (tL) NF 


(0) 


BSR 
SaS roa +3R2) 


=(1 + Rp4 en Soe Tdt S(k) 


x cos (f gal +A) | (7.2.21) 
0 a(t)/3(1 + RW) i. 
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oO 


3R: 
Suyq(tL) = = -tT (x) 


+ == 
V3q(1 + R1)3/4 


xsin ( S 28 + a0) 
0 a(t)/3(1+ RM) 


Here R(t) = 35g(t)/45, (t), R = Ritz), and ay = a(t); T(K), S(), 
and A(x) are the transfer functions defined and calculated in Section 6.5; 
k = V2q/ qEQ, Where qgq is the wave number that comes into the horizon 
at matter—radiation equality; and T (t) is the acoustic damping rate (6.4.25). 
Using Eqs. (7.2.20)-(7.2.22) in Eqs. (7.2.18) and (7.2.19) gives the form 
factors as 


éT E Tdt S(k) 


(7.2.22) 


(0) 


R 
F(q) = Saro Ri 


(1 +R 4 e Jo" Fat Sc) 
tL qdt ) 
xX COS (J PONET O + A(k) ; (7.2.23) 


VIRE 
= a aE 
5(1 + R1)3/4 


x sin (f ga a awo) (1.2.24) 
o0 aOIl + RO) i E 


Note that the “slow” terms have canceled in G(q), and would have canceled 
in F(q) ifit were not for afinite baryon/photon density ratio at last scattering. 

Now we must take up a complication that arises only for the “fast” 
part of the form factors in the case of short wavelengths. We have been 
assuming that the opacity of the universe drops to zero instantaneously at 
a time tz of last scattering, but of course the drop takes place during some 
finite interval of time, over which the form factors must be averaged. This 
makes little difference for the contribution of the slow modes, but for large 
wave numbers the cosines and sines in Eqs. (7.2.23) and (7.2.24) are rapidly 
oscillating functions of tz, so the fast terms can be significantly reduced by 
this averaging. This is similar to what in other contexts is called Landau 
damping, except that usually Landau damping arises from a spread in the 
frequency of an oscillation, while here it is produced by a spread in the 
moment at which the oscillating amplitude is observed. 


Gq) = -h Tt S() 
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To continue with our analytic treatment, since the probability of last 
scattering is a sharply peaked function of time, we can approximate it as a 
Gaussian: the probability that last scattering occurs between t and t + dt 
will be taken in the form 


exp[—(t — t,)?/207] dt 


P(t)dt = 7.2.25 
In Eqs. (7.2.23) and (7.2.24), we must make the replacements 
tL t 
cos wdt+A +00 cos wdt+A 
l (Jo ) ->f P(t)dt {~ (Jo ) , (7.2.26) 
sin ( i wdt + A) —o0 sin ( fj odt + A) 


where w = q/av3(1 + R). For a sharply peaked distribution function P(t), 
we can do these integrals by expanding the arguments of sines and cosines 
to first order in t — tz: 


t tL 
Í wdt~ f wdt + w(t — tL). 
0 0 
The integrals (7.2.26) are now easily done 


m cos ( fj odt + A) 


—0o ee sin T wdt + A) 


cos (i? wdt + A) 


(7.2.27) 
sin (ie odt+ A) 


~ exp(—w7 9; /2) 


Thus, the whole effect of this averaging is to introduce an additional damp- 


ing factor exp(—o7 07 /2) in the fast part of the form factors. Both T and 


2 i 2 ; 
wz, are proportional to q^, so we may write 


tL 
/ T dt + w707/2 = gda}, (7.2.28) 
0 


where dp is a damping length, given by Eqs. (6.4.25) and (7.2.27) as 


dj = dein F di andau > (7.2.29) 
tL t 16 R? 
gap Í y a 7.2.30 
Sik 5 J 6g +R) )15 A+R) hey) 
2 
2 Ot 
2 ee Bai 
dÉ andau 6(1 + Rr) > (7 3 ) 
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with t, = 1/wc the photon mean free time, and R = 3p8/4py,. 
To evaluate the Silk damping term, we recall that R « a, so 
1 R? 


t, = = ; 7.2.32 
Y neore npoRo( — Y)Xorc i ) 


where Ro = 398/492, is the present value of R, Y ~ 0.24 is the fraction of 
nucleons in the form of un-ionized helium around the time of last scattering, 
ngo = BHEO B/87 Gmy is the present number density of baryons, and X (R) 
is the fractional ionization, calculated in Section 2.3. Also, 


ye dR 7 RdR 
RHoy/Qu(Ro/R)3 + QR(Ro/R)4 Hoar Re”, /Reg +R 
(7.2.33) 


where Reo = QRRo/Qy = 3QRQB/4Q YL, is the value of R at matter- 
radiation equality. Putting this all together, the Silk damping length is 
given by 


R2 
dz. = L 
Silk 9/2 
6d — Y)ngoorcHoy 2m Ro 
RL R? dR 16 R? 
x + (7.2.34) 
0 X(R)U+R)/Reqg +R |15 A+R) 


Also, the standard deviation o; in the time of last scattering is related to 
the standard deviation o in the temperature of last scattering, calculated in 
Section 2.3, by o; = 3tLo /2TL, so 


30212 
eee 7.2.35 
Landau 872 (1 + Rr) ( ) 


The form factors (7.2.23) and (7.2.24) may now be written as explicit 
functions of wave number 


(0) 


R 
F(q) = 5h) 3radr iat 


— (1 + Ry)7"/48(qdr Jaz) et 8% cos (adi jap Adrar) |, 
(1.2.36) 
VIRo 


MORTER 


-dpl § (adr Jaz) sin (adujar + A(qdr/ar)) , 
(7.2.37) 
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where dr is a length defined by writing the argument of the transfer functions 
as k = qdr /aL, so that Eq. (6.5.9) gives 


J&R 0017 


dr = _ 
T= OFDM Qu 


Mpc. (7.2.38) 


Also, dy is the acoustic horizon distance at last scattering, given by 


d tL dt 
S =a. | aJ3. + R) 


n 2 -i V1+ Ri +/ReQ + Ri 
~~ Ao /3RLQmu( + zL)? 1+./REQ 


where again Rag = 3QRQB/4QyQ, and Rz = 398/49, (1 + Zz). 


Before using these form factors in Eq. (7.2.17), there is one more com- 
plication that needs to be mentioned. At a redshift Zreion of order 10, 
the neutral hydrogen left over from the time of recombination becomes 
reionized by ultraviolet light from the first generation of massive stars. 
The photons of the cosmic microwave background have a small but non- 
negligible probability 1 — exp(—Teion) (where Trion is the optical depth 
of the reionized plasma) of being scattered by the electrons set free by 
this reionization. The temperature anisotropy AT is then the sum of two 
terms. One arises from photons that are not scattered by the reionized 
hydrogen, and is just equal to the anisotropy we have calculated times the 
probability exp(—Tyeion) of no scattering. The other term arises from scat- 
tered photons, and since this scattering occurs at redshifts much less than 
zy % 1,090, we see the anisotropies at a smaller distance, and hence at 
much lower values of 4. Thus the effect on Cre of scattering by the 


` (7.2.39) 


reionized plasma is simply to multiply Cira by a factor exp(—2Treion), 
except for very small values of £, where in any case cosmic variance inter- 
feres with the interpretation of observations. This means that observa- 
tions of temperature anisotropies alone cannot effectively disentangle the 
reionization probability from the over-all scale of the function R? that 
characterizes primordial fluctuations; they can only tell us the value of 
KAR expP(—2Treion). The measurement of the polarization of microwave 
photons (discussed in Section 7.4) produced by scattering after reionization 
suggests that exp(—2Teion) © 0.8. 

Itis conventional to parameterize the quantity R4 ina form equivalent to 


ng—1 
[ROP = Ng? =) (7.2.40) 
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with ns perhaps varying with wave number. This parameterization is 
convenient, because then 


(£) iy p= ( ep u 
rL Pur’ B3 \krda(l +21) ? 


and it will turn out that ns is not very different from 1. It is only M a og 


that enters in the normalization of IRo, so the choice of kp is arbitrary; it 


is conventional to take it as kz = 0.05 Mpc7!. 


We conclude from all this that for reasonably large values of £ (say, £ > 
20), where we can ignore cosmic variance and the integrated Sachs—Wolfe 
effect, use the large £ approximations that led to Eq. (7.2.17), and treat the 
effect of reionization as a simple factor exp(—2tTyeion), the quantity usually 
quoted as giving the scalar contribution to the multipole coefficients is 


L(+ Ce, g 4r TÈN? e7?reion oo Be ns—1 
a J dp 
27 25 j 


lr 
1 
—__——|137(B0/tr)R 
aaa (Be/€7) Ri 
2 
—(1 + RL)" S (pe/er) eP ©/ cos (Bt/tn i ageen)| 


WP, 


aE -28 PD $280 /07) sin? (Bt/tn F aæejen)| , 


(7.2.41) 
where 
lp =da4/dp, £r=d4/dr, ly =d4/dy, tR=(+2z)krds, 
(7.2.42) 


and again d4 is the angular diameter distance of the surface of last 
scattering: 


1 : d 
d4 Æ realy = -y sinh ay | xX ; 
Qg Hol +22) 1/0421) J Q4x4 + gx + Mx 
(7.2.43) 


with Qr neglected, and Qg = 1 — Qa, — Qy. If we assume as discussed 
earlier that the integral over £ is dominated by values 6 ~ 1, neglect the 
Doppler term, and for the moment neglect the term proportional to the 
transfer function 7 , then Eq. (7.2.41) shows that Chr , has peaks at x (£) = 
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n, 27, 37, etc., where x (£) = €/Ly + A(£/£r) is the phase of the cosine in 
Eq. (7.2.41) for 6 = 1. The presence of the positive term 37 Rz arising from 
the inertia of the baryonic plasma enhances and slightly shifts the peaks for 
X = 7, 37, etc., where the cosine is negative, and reduces the peaks for x = 
2x, 4, etc., where the cosine is positive. For large £ the Silk and Landau 
damping factor exp(—B7¢? /d}) damps out all these peaks. As we will soon 
see, the results of numerical calculations exhibit this expected pattern, of a 
sequence of decreasing peaks, with odd peaks somewhat enhanced over the 
even peaks. 

We can now read off the dependence of the quantities appearing in C$ 2 
on various cosmological parameters. Taking as fixed the well established 
value of the present microwave temperature, (which yields values for Qy h? 
and Qrh*? = Q,h? + Q,h?), and also fixing the values of tz, zL, ne oO, 
which are only weakly dependent on other cosmological parameters, we see 
that 


© Rr X Qph?. 


¢ The integral in Eq. (7.2.34) for d Sik is a complicated but not very 
sensitive function of Qgh? and Qmh?. Aside from the integral, dese is 
proportional to (Qgh?)~7/2(Qyh?)~!/2. 


eL 


Landau 


depends on Qgh? through a factor (1 + Rz) !. 


e 


dr x (Qyh*)7! 


e 


Aside from a slowly varying logarithm, dy « (QBA (Q uh". 


e 


Only d4 depends on Ho, Qa, Qm, QB, or Qg apart from a dependence 
on Qgh? and Qyh*. (For any observationally allowed values of Q, 
or Qx, the effects of a constant vacuum energy or of spatial curvature 
would be quite negligible at times before recombination, so their effect 
on Chr , 1s limited to their influence on the propagation of light after 
recombination, that is, on d4.) 


To see how well the various approximations we have made work in prac- 
tice, we shall calculate Cre for a realistic set of values for cosmological 
parameters taken from a fit to data on the microwave background from 
the CBI, ACBAR, and first-year WMAP observations (about which more 
below). These are the same parameters that have been used in a full-scale 
computer calculation whose results are readily available,° so that we will 


SDN. Spergel et al., Astrophys. J. Suppl. 148, 175 (2003) [astro-ph/0302209]. 
Shttp://lambda.gsfc.nasa.gov/data/map/powspec/wmap lcdm_pl_model_yrl 
v1l.txt 
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conveniently be able to compare Eq. (7.2.41) with an accurate numerical 
calculation. The cosmological parameters of this set are 


Qyh* = 0.13299, Rgh? = 0.02238, h=0.71992, Q,=1-Qy. 


(7.2.44) 
and’ 


ns = 0.95820, kr = 0.05 Mpc™!, N? = 1.736 x 107! , 
e~?treion = (0.80209 . (7.2.45) 


We take To = 2.725K, which yields 2h? = 2.47 x 1075, and we assume 


three flavors of massless neutrinos, which gives Q Rk = 1.6813Q,h7 = 
4.15 x 1075. We will also adopt the parameters describing recombination 
calculated in Section 2.3 (which are not very sensitive to other cosmological 
parameters) for Qgh* = 0.02 and Qyh? = 0.15; in particular, 


1+2z,=1,090, o =262K, tz, = 370,000 yrs. (7.2.46) 
From the values (7.2.44) of Qyh? and Qgh’, we find 
Ro = 679.6, Ry =0.6234, ReQ = 0.2121. (7.2.47) 
Then Eqs. (7.2.38), (7.2.39), (7.2.43) give 
dr =0.1331 Mpc, dy = 0.1351 Mpc, d4 = 12.99 Mpc, (7.2.48) 
while the damping lengths are given by Eqs. (7.2.34), (7.2.35) and (7.2.29): 


dsitk = 0.006555 Mpc, dLandau = 0.004809 Mpc, dp = 0.008130 Mpc . 
(7.2.49) 
Finally, the parameters (7.2.42) appearing in Eq. (7.2.41) are 


£r =97.60, £y =96.15, £p=1598, £r = 708 (7.2.50) 
while the factor multiplying the integral is 


4r Tx N7e~ Aion 


= 519.7 uK? . 7.2.51 
75 u ( ) 


7The parameter N 2 is related to what is often given as an “amplitude” A by 47N 2 = 
20,0002 A/9T,,9(uK)* = 2.95 x 10-9. For instance, see L. Verde et al., Astrophys. J. Suppl. 148, 
195 (2003) [astro-ph/0302218]. It should be noted that, because of different conventions used in writing 
Fourier integrals, the quantity 72, in this paper is equal to what we have defined as Roo y times a factor 


(27)3/2. Reference 6 uses A = 0.73935, corresponding to the value of N2 given in Eq. (7.2.45). 
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Using these values, the result of using Eq. (7.2.41) with the parameter set 
of reference 6 is 


L+ DCR y sga ge N T008 
——* = 519.7 uK dp | == 
ir ux? f aB (75) 


3(0.6234)T (B£/97.6) 


l 
j l =i | 
2 
—~(1.6234)~"/45(80/97.6) e7 2/1598? cos (86/96.15 + A(pe 197.8) | 


3,/p2 —1 


B4(1.6234)3/2 
x sin? (86/96.15 + A(pe/97.6)) | 


e 2(BE/1598)" 5280 /97.6) 


(7.2.52) 


The integral over 8 converges very rapidly, and can be done with a cut-off 
at 6 = 5; raising the cut-off to 6 = 50 has a negligible effect. 

The results are shown in Figure 7.1, in comparison with the more accu- 
rate Boltzmann hierarchy calculation of reference 6, based on Eq. (7.2.10). 
Evidently the hydrodynamic calculation does quite well; like the computer 
calculation of reference 6, it shows a high first peak, followed by two nearly 
equal lower peaks, followed by a decaying tail punctuated by successively 
lower peaks. To give a quantitative comparison, Tables 7.1 and 7.2 show the 


[I+1)C, 
20 
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pS a Eee 
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Figure 7.1: The scalar multipole coefficient (£ + 1)C, a /2x in square microKelvin, vs. £, 
for the cosmological parameters given in Eqs. (7.2.44) and (7.2.45). The hydrodynamic 
approximation (7.2.41) is indicated by the dashed curve, while for comparison the solid 
curve gives the more accurate large scale computer calculation of reference 6, based on 
Eq. (7.2.10). 
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Table 7.1: Comparison of the results for the values of £ at the first five peaks in £(£ + 
OP: ¿/27, as given by Eq. (7.2.52) and as given by reference 6. 


Eq. (7.2.52): 220 541 821 1134 1425 
reference 6: 219 536 813 1127 1425 


Table 7.2: Comparison of the results for the values of £(£ + bere ¿/27 (in uK?) at the 
peaks listed in Table 7.1, as given by Eq. (7.2.52) and as given by reference 6. 


Eq. (7.2.52): 5155 2694 2783 1126 746 
Reference 6: 5591 2525 2451 1221 806 


peak positions and heights calculated using Eq. (7.2.52) and those given by 
the results of reference 6. In all cases, the peak heights given by Eq. (7.2.52) 
agree with the more accurate computer results to within 10%, while the 
results of Eq. (7.2.52) for the peak positions are almost embarrassingly 
good, in no case being off by more than 1%. 

Among other things, the general success of the calculations of this sec- 
tion shows that the evolution of cosmological perturbations is primarily 
hydrodynamic, in the sense that it can be well described by the equations 
of hydrodynamics without the full apparatus of coupled Boltzmann equa- 
tions used in computer calculations like those of reference 6. The Boltz- 
mann equation is implicit in our calculations, because in calculating the 
Silk damping rate we have used standard values for the shear viscosity and 
heat conductivity that were obtained by using the Boltzmann equation for 
photons in an ionized gas, but evidently not much is lost by not solving the 
Boltzmann equation over again in a cosmological context. 

To the extent that the general formula (7.2.41) has been validated by this 
comparison, we can use it to see what can be learned from measurements 
of CTT œ: 


¢ Eq. (7.2.41) shows that the shape of the function £(£ + I) CPr ¢, as for 
instance the set of ratios of the peak positions or of the peak heights, 
depends on only four quantities: dp, dy, dr, and ng. Also, with the 
present radiation temperature and the number of massless neutrinos 
fixed, the three lengths dp, dy, and dr depend only on the two cos- 
mological parameters Qgh? and Qyh?, which can therefore be found 
from the shape of the function £(£ + 1)CPp. te 
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e Measurement of the scale of this function’s -dependence (as for 
instance, the £ value of any one peak) depends on these two parame- 
ters, but also on d4, which is a function not only of Qgh? and Qyh? 
but also of h and Q4. It is therefore impossible, without taking the 
integrated Sachs—Wolfe effect into account, to use measurements of 
Chr , alone to make separate determinations of both A and Q,. It is 
often simply assumed that the universe is spatially flat, in which case 
to a good approximation Q, = 1 — Qm, and then measurements of 
Chr , can be used to determine / as well as Q gi? and Quh?. 


e 


Measurement of the magnitude of the function £(£+1)Crr e for £ > 1 
(as for instance measurement of the height of any one peak) only tells 
us the quantity N? exp(—2Treion), SO as already remarked we cannot use 
it to make a separate determination of N? or Treion- This ambiguity is 
resolved by measurements of the polarization of the cosmic microwave 
background, discussed in Section 7.4. 


The original discovery of microwave background anisotropy was made 
by the COBE collaboration, and is discussed in Section 2.6. This only gave 
information about the anisotropy for relatively small £, well below the posi- 
tion of the first acoustic peak at £ ~ 200. This discovery was followed by 
a series of balloon-borne and ground-based observations, which gave def- 
inite evidence for the first acoustic peak, and some data on higher peaks, 
extending in the case of the CBI collaboration up to values of £ beyond the 
position of the fifth acoustic peak, at £ ~ 1400. The accuracy of these mea- 
surements up to about £ ~ 600 was then greatly improved by observations 
made by a remarkable satellite mission, known as the Wilkinson Microwave 
Anisotropy Probe, or WMAP? The WMAP satellite was launched on June 
30, 2001, made loops around the moon to pick up kinetic energy from the 
Moon’s motion, and finally reached an orbit about the equilibrium point 
known as L2. This point orbits the Sun at the speed needed to keep it about 
1.5 x 10° km from Earth, on the other side of the earth from the sun, a loc- 
ation chosen to isolate the instrument from microwave radiation from the 
sun, earth, or moon. The satellite carries two back-to-back 1.4 x 1.6 meter 


8The collaborations are ARCHEOPS: A. Benoit et al., Astron. Astrophys. 399, L19, L25 (2003) 
[astro-ph/0210305, 0210306]; CDMP & MAT/TOCO: A. Miller et al., Astrophys. J. Suppl. 140, 115 
(2002) [astro-ph/0108030]; BOOMERANG: J. E. Ruhl et al., Astrophys. J. 599, 786 (2003) [astro- 
ph/0212229]; MAXIMA: A. T. Lee et al. Astrophys. J. 561, L1 (2001) ; DASI: N. W. Halverson et al., 
Astrophys. J. 568, 38 (2002) [astro-ph/0104489]; CBI: T. J. Pearson et al., Astrophys. J. 591, 556 (2003) 
[astro-ph/0205388]; ACBAR: C. L. Kuo et al., Astrophys. J. 600, 32 (2004) [astro-ph/0212289]; and 
VSA: C. Dickinson et al., Mon. Not. Roy. Astron. Soc. 353, 732 (2004) [astro-ph/0402498]. 

°C. L. Bennett et al., Astrophys. J. Suppl. 148, 1 (2003) [astro-ph/0302207]. Aspects of this mission 
are treated in detail in other articles in the same volume of Astrophys. J. Suppl. 
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microwave receivers, cooled by thermal radiators to about 90 K, and mea- 
sures the polarization of the microwave background as well as differences 
in its temperature over the whole sky. 

After a year of observation, Cr7¢ had been measured out to £ ~ 600 with 
small errors arising mostly from cosmic variance and foreground emission.!° 
These measurements were fit to the results of the “ACDM” model we have 
been studying, with zero curvature; the contents of the universe supposed to 
consist of photons, three flavors of massless neutrinos, baryonic matter, cold 
dark matter, and a constant vacuum energy; and a primordial spectrum of 
purely adiabatic fluctuations given by Eq. (7.2.40), with ns constant. The 
values of cosmological parameters derived from this fit were:!! 


* gh? = 0.024 + 0.001 

e Qyl? = 0.14 + 0.02 

e h= 0.72 +0.05 

e |N]? = (2.1 40.2) x 107! 


e ns = 0.99 + 0.04 


BYS +0.076 
* Treion = 0.16619.976 


(Measurements of polarization were used here chiefly to measure the optical 
depth Treion of the reionized plasma,!* which is needed to obtain |N]? from 
the value of |N |? exp(—2tyeion) given by the measured values of Ce. e) 

In March 2006 the WMAP group announced the results of the second 
and third years of observation.!? The results are shown together with a best 
fit of L(£ + I)CPp ¿/27 to the ACDM model (with zero tensor anisotropies) 


in Figure 7.2. This fit gave the parameters:!4 
* Qgh? = 0.02229 + 0.00073 
© Qh? = 0127775 0085 
0.031 
e h = 0.732+0032 


e IN|? = (1.93 + 0.12) x 10710 


10G. Hinshaw et al., Astrophys. J. 148, 135 (2003) [astro-ph/0302217]. 

DN. Spergel et al., ref. 5. Errors given here represent a 68% confidence range. 

12A. Kogut et al., Astrophys. J. 148, 161 (2003) [astro-ph/0302213]. 

13Temperature and polarization results are given by G. Hinshaw et al., Astrophys. J. Suppl. Ser. 
170, 288 (2007) [astro-ph/0603451] and L. Page et al., Astrophys. J. Suppl. Ser. 170, 335 (2007) [astro- 
ph/0603450], respectively. 

14D, N. Spergel et al., Astrophys. J. Suppl. Ser. 170, 377 (2007) [astro-ph/0603449]. 
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Figure 7.2: Comparison of theory and WMAP observations for the multipole coefficient 
£(€+ 1)Ce/2z in square microKelvin vs. £, from D. N. Spergel et al., astro-ph/0603449. The 
solid data points are the three-year WMAP data, and the light gray data points are the first 
year WMAP data. The bottom curve is the best fit to the three-year WMAP data. The top 
curve is the best fit to the first-year WMAP data, and the middle curve is the best fit to the 
first-year WMAP data combined with data from CBI and ACBAR. 


* ng = 0.958 + 0.016 
© Treion = 0.089 + 0.030 


The new results are consistent with those found before, but evidently 
there has been a significant improvement in the precision with which these 
parameters are known. This increased precision has now revealed the 
important information that ns is very likely somewhat less than unity, as 
expected on the basis of typical inflationary theories, to be discussed in 
Chapter 10. It is also important that the optical depth of the reionized 
plasma found by polarization measurements is considerably less than pre- 
viously found, suggesting a more plausible time of reionization.!> Because 
the magnitude of Chr, ¿ 1s proportional to |N |? exp(—2tyeion). the reduction 


iN Treion has led to a corresponding reduction in the estimated value of |N|*; 
there has been very little change in the reported value of |N]? exp(—2tyeion)- 


'SFor the implications of the three-year WMAP measurement of the plasma optical depth 
for theories of early star formation, see M. A. Alvarez, P. R. Shapiro, K. Ahn, and 
I. T. Iliev, Astrophys. J. 644, L101 (2006) [astro-ph/0604447]; Z. Haiman and G. L. Bryan, Astro- 
phys. J. 650, 7 (2006) [astro-ph/0603541]; T. R. Choudhury and A. Ferrara, Mon. Not. Roy. Astron. 
Soc. 371, L55 (2006) [astro-ph/0603617]; M. Shull and A. Venkatesan, astro-ph/0702323. 
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It is truly satisfying to see that the values of h, Q mh?, and Qgh? obtained 
from the cosmic microwave background anisotropies confirm the values 
obtained by the very different methods discussed in Chapters | and 3. The 
values of |N|? and ns are also in good agreement with values inferred from 
observations of large scale structure, discussed in the next chapter. The 
general agreement of theory and observation, both for the microwave back- 
ground anisotropies alone and for the microwave background anisotropies 
in conjunction with other observations, goes far to confirm the general 
assumptions of the cosmological model, including cold dark matter and 
dark energy, that we have been using. 


7.3 Temperature multipole coefficients: Tensor modes 
We next consider the contribution of cosmological gravitational radiation to 
the temperature multipole coefficients.! According to Eq. (7.1.5), in the case 


of a single dominant tensor mode the tensor contribution to the fractional 
temperature fluctuation is 


ATAN 1 
( 2) = -53 Ð | Ba 6.9 ueu) 


T 
0 Aad? 


to to dt’ 
a) dt exp aaf ral d(q,t), (7.3.1) 
t t a(t’) 


where for brevity we have now introduced the quantity 


t 
d(q, t) = exp -f t wet) (Bun = sel, D) . (7.3.2) 
t 


As a reminder: £(q, A) is the stochastic parameter for the mode that does 
not decay outside the horizon, which is assumed to dominate the tensor 
perturbation; ex (ĝ, à) is the polarization tensor defined in Section 5.2 for 
a gravitational wave with wave number q and helicity 4; tı is any time suf- 
ficiently early before recombination so that any photon present then would 
have been scattered many times before the present; we (t) is the photon colli- 
sion rate at time t; D4(¢) is the gravitational wave amplitude, defined in Sec- 
tion 5.2; and Y (q, t) is the tensor source function, defined by Eqs. (6.6.11) 
and (6.6.12). 


lFor an early calculation of the first few multipole moments, see V. A. Rubakov, 
M. V. Sazhin, and A. V. Veryaskin, Phys. Lett. 115B, 189 (1982). A general formula was given by 
R. Fabbri and M. D. Pollock, Phys. Lett. 125B, 445 (1983). 
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Again using Eq. (7.2.2) to expand the exponential in spherical harmonics, 
the tensor temperature fluctuation has the expansion 


A(T) ae 
(AT) = aF eml Ô), (7.3.3) 
lm 
where 
ab ¢m=To >) J dq PA, X) fom.) » (7.3.4) 
ea 


fina = -27 | dP YP" Anh eG EYE DYE 
LM 


to 
x J dt j(qrO©) dq. (1.3.5) 
ti 


and r(t) is again the radial coordinate of a point from which light emitted 
at time t would reach us at the present: 


to dt! 
o= f Fok (7.3.6) 


It will be very convenient first to calculate fem(q, à) for q in the three- 
direction Z. In this case we have a (q) = mov 2L + 1/47. Also, using 
Eq. (5.2.15), for ñ = (sin@ cos ¢, sin@ sing, cos @) we have 


1 : 
fishy ey, 42) = -5 sin? 9 e+% = 4 [= YE Â) . (7.3.7) 


The integral over fi is given by a special case of the general formula” 


(2A +1)(2€ +1) 
4n(2L + 1) 


J di YE @ YE Y~ A) = / 


x Cea (L, M;m, =u) Cea (L, 0; 0, 0) > (7.3.8) 


where Cea(L, M;m, u) is the usual Clebsch-Gordan coefficient for 
combining angular momentum quantum numbers £, m and A, u to form 


2See, e.g., J. M. Blatt and V. F. Weisskopf, Theoretical Nuclear Physics (John Wiley & Sons, New 
York, 1952): Appendix A, Eq. (5.11). 
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angular momentum quantum numbers L, M. Hence 


2+ 1 
fom qs #2) = ~2f TEO D E CoL, 05 42, #2) Carl, 00,0) 
L 


to 
we J dt i(ar®) d(q, t). (1.3.9) 
ti 


In our case we have A = 2, u = +2, and M = 0, so the relevant Clebsch- 
Gordan coefficients are 


3(€ + 2)(€+ 1) 
2(2€ + 1)(20 + 3) 


Ce2(€ + 2,0; 0, 0) -j 


H g= 
Cor(t + 2,0; +2, F2) = 
A ie Vases 


e+ 1) 
(2@ — 1)(2€ + 3) 


Ce2 (£, 0; 0,0) = -| 


36 = 1) G42) 


Cro(€, 0: +2, F2 
el F) =A- D4 


3€(€ — 1) 


Cor(l — 2, 0:0, 0 
call — 2, = VI- DD 


+ 
-l 


Ce2(€ — 2,0; +2, 2) = 


1 / @+DE+2) 
2V Ql- DQL++1)’ 


while Cz2(€ + 1,0;0,0) = 0. Putting this together shows that for q in the 
three-direction, the non-vanishing values of the quantity (7.3.5) are? 


A ; 2€+ 1)(€+2)! 
fim(q,#2) = “| ima f at dant 


| jex2(gr) (ar) ie-2( ar) | 


FDE | -DEFI QE- DFD 
(7.3.10) 


3This is essentially the result originally obtained by L.F. Abbott and M. B. Wise, Nucl. Phys. B 244, 
541 (1984), and A. A. Starobinsky, Sov. Astron. Lett. 11(3), 133 (1985), generalized to an arbitrary 
gravitational wave amplitude D,(t) and including the correction proportional to W in Eq. (7.3.2). 
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This can be greatly simplified by iterating the familiar recursion relation 


Je(e)/p = Ge- (0) + Jeri (L+ 1), 
which gives 


Jelo) Je-2(p) 2je(p) Je+2(p) 
= oh N o (7.3.11) 
p (2€4+1)2@-—1) (2€4+3)2@-1) (2€41)2¢+4+3) 


Using this in Eq. (7.3.10), we have 


A 224+ 1)e 2)! Jelar@ 
fang 32) = f” nt if dt diq, d) a 
(7.3.12) 


The amplitude (7.3.5) can now be found for a general direction of q 
by performing a standard rotation S(q) that takes the three-axis into the 
direction g. (An explicit formula for S(qg) will be given in the next section; 
it is not needed here.) This gives 


Fem) = Y DEn (S@) fem (42,0) - (7.3.13) 
+ 


The coefficients (7.3.4) are then 


2€+1)@+2 X 
g TEDULO S 2) dq Bq, £2) D\)..(S@) 


2(€ — 2)! 
we d(q,t nee) 7.3.14 


We can now easily calculate the multipole coefficients of the temperature 
correlation function, defined by 


(AF em arm) = e'mm Cire . (7.3.15) 


For this purpose, we recall that the stochastic parameter £ (q, A) is normal- 
ized so that 

(Bq. X) 2*0, X) = ðw a- q’) . (7.3.16) 
In the calculation of Crr, ¿ we encounter an angular integral 


[ea DP (s@)D @.2(s@) 
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This matrix can readily be seen to commute with all rotation matrices, so 
that it must be proportional to d,.75¢¢ with an m-independent coefficient. 
To calculate this coefficient, we can set m = m’ and £ = £’ and sum m from 


—£ to +£; using the unitarity of the matrix De, (s (), this must equal 
f d? = 47, so 
27 pO n \ pe) KANE 4n 
f Pa RASO) RSD) = mde 03I 
Using this and Eqs. (7.3.14) and (7.3.16) in Eq. (7.3.15), we see that the sum 
over helicities in Eq. (7.3.14) now just yields a factor 2, sof 


2 
4n?(€+2)IT? (° dq) f” Je (a0) 
T 0 

= —a | .3.1 
Crrye GD j 2 [ dt d(q, t) AO (7.3.18) 


(This formula can be obtained more easily by a direct calculation of the 
temperature correlation function, without calculating the af, tm Dut we will 


need the ar ¢m in the next section to find the correlation between polarization 
and temperature anisotropies.) 

It remains to calculate D,(t) and the source function W(q,t), As we 
saw in Section 6.6, if the gravitational wave amplitude D,(f) is written as 
a function of K = qV2 /HEQdEQ and y = a(t)/dgq instead of q and t (the 
subscript EQ referring to the time of radiation—matter equality), then aside 
from an over-all factor D3 (equal to the value of D4 (t) outside the horizon), 
the amplitude D,(t) is independent of any other cosmological parameters. 
We write this as 


Dy(t) = D? Dk, y) . (7.3.19) 


For a spectrum of gravitational waves that is scale invariant outside the 
horizon, the quantity q? PAR isa constant. To take account of more general 
possibilities, it is conventional to write it as proportional to a power nr of 
q, or equivalently, 
31m0;2 2 (alao N" 
qD = NF (=) (7.3.20) 
D 

where kp is an arbitrary wave number, often taken as kp = 0.002 Mpc™!, 
and N7 is a constant analogous to the constant N that describes the strength 


4This result is derived in a different way by M. Zaldarriaga and U. Seljak, Phys. Rev. D 55, 1830 
(1997) [astro-ph/9609170]. Their result is the same as Eq. (7.3.18), if we take their undefined gravitational 
wave amplitude h to be Dg(t)/2, with their stochastic parameter normalized so that P} = 1. 
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of the adiabatic scalar mode. The conventional ratio r of tensor and scalar 
modes is r = 4|N7|*/|N|?. Also, until vacuum energy becomes important, 
we have dt = /2y dy/HgQ./1 + y, so we can write 


dt dy 
2na =2k/1 Fy. 
af x È Jlt+y i 


The wave equation (6.6.58) then gives 


2D 1 2d D 
Ga n (oS +) SEMEN) Se Dee 5) 
dy 2 y dy 


24f, f? aD(x,y’) ., 
ae zf K(2« T+ y VRA) A y, (1.3.21) 
y Jo dy 


where f, = py /Ar = 0.4052 and K (v) = j2(v)/v, with the initial condition 
D(x,0) = 1. Having found D(x, y) in this way, the source function Y (q, t) 
is calculated in computer programs such as CMBfast and CAMB by using 
Eq. (6.6.21), with the amplitudes A and A found by a numerical sol- 
ution of the coupled Boltzmann equations truncated at some maximum 
value of £. There is an easier alternative procedure, based on the integral 
equation (6.6.27):5 


t t 
W(q,t) = ; J dt' exp |- J welt”) a 
ti t 


. , t dt" : t dt” : 
« [ab (af aay) tee" (af aay) #9), 
(7.3.22) 


where F (w) = jow) — 2j1(v)/v + 2j2(v)/v2. We can also put the differential 
optical depth in the form 


dt(y) = @ dt = neorc dt = —_—— 

y/l+y 
where X (y) is the fractional hydrogen ionization calculated in Section 2.3, 
which depends on Qgh? and Q mh? as well as y, and A is the dimensionless 
constant 


X(y) dy, 


325 Qu He 
A = 0.762 ET = 1.9646 x 105 (Qh?) (Qh?) . 
82 Gimp QR 


5S. Weinberg, Phys. Rey. D 74, 063517 (2006) [astro-ph/0607076]; D. Baskaran, 
L. P. Grishchuk, and A. G. Polnarev, Phys. Rev. D 74, 083005 (2006) [gr-qc/0605100)]. 
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(It is assumed that the baryons are 76% hydrogen, with the rest un-ionized 
in the era of interest.) It follows that the source function takes the form 


Y(q, t) = Dy SK, y) (7.3.23) 


where S depends only on Q mh? and Qgh? as well as « and y, and satisfies 
the integral equation 


3P, Vay) 25 
se» =; | dy ap|- g ay] 


y ; dy" 


dD(«, y’) - 
x | -226 k (xi yTFr- TF7) 


dt(’) 
dy’ 


+ F (21/1 +y- yl +y'l) sey) . (7.3.24) 

The multipole coefficient calculated from Eqs. (7.3.18)-(7.3.24) is shown 
by the solid curve in Figure 7.3. The dashed line in Figure 7.3 shows the 
result of simplifying the expression for the multipole coefficients in the case 
£ >> 1, using the approximate formula (7.2.14) for the spherical Bessel 


Il+1) CF; 


l 


200 400 600 800 1,000 


Figure 7.3: The tensor temperature-temperature multipole coefficient €(€ + 1)C Tr ¿/27 in 
square microKelvin, vs. £, from ref. 6. The solid line is the result of using the essentially 
exact formula (7.3.18); the dashed line gives the result of the large-€ approximation (7.3.25). 
The cosmological parameters used in these calculations are described in the text. 
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function in Eq. (7.3.18), which gives® 


[0.0] 
+ IC, > arra f g° dq 
0 


2 
3 
x J dt d(q, t) Fee u cos [acannon] 
r(t)>l/q vsin b cos b=£/gri(t) 
(7.3.25) 


The source function (gq, t) is calculated in both cases using the integral 
equation (7.3.22), instead of using the equivalent formula (6.6.21) together 
with the truncated Boltzmann hierarchy (6.6.18), (6.6.19). Both of these 
curves are calculated with a cosmological model that in most respects is 
consistent with current observations: We assume Qg = 0, and constant 
vacuum energy, with Qg = 0.0432, Qm = 0.257, Qa = 0.743, h = 0.72, 
and Tp = 2.725K. In calculating the photon collision frequency, we use 
the Recfast recombination code,’ with helium abundance Y = 0.24. The 
parameters in Eq. (7.3.20) are taken as NZ. = 4.68 x 1071! (corresponding 
approximately to the WMAP3 value of the scalar amplitude N quoted in 
the previous section, andr = 1) and; = 0. Reionization is ignored in this 
calculation. To take into account a different value of r or a finite optical 
depth t of the reionized plasma, for £ > 10 it is only necessary to multiply 
the multipole coefficients shown in Figure 7.3 by r exp(—2r). 

(In the approximation of a sharp transition at time tz from thermal equ- 
librium, in which w,(t) >> H(t) and W(qg,t) = 0, to perfect transparency, 
in which w(t) = 0, Eq. (7.3.25) becomes 


[0,0] 
eé+1Cch.y > arr f qdq 
, 0 


cos? 


. s’ b 
x dt D(t) $ —— cos | £(tanb — b) — 17/4 
ae i = | f ] 


cos b=t/qr(t) 
(7.3.26) 


Because of the spread of values of tan b — b, we cannot here make the sort 
of further simplification that we made for scalar temperature fluctuations 
in going from Eq. (7.2.16) to Eq. (7.2.17).) 


6R. Flauger and S. Weinberg, Phys. Rev. D 75, 123505 (2007) [astro-ph/0703179]. 
7S, Seager, D. D. Sasselov, and D. Scott, Astrophys. J. 523, L1 (1999) [astro-ph/9909275]; Astrophys. 
J. Suppl. 128, 407 (2000) [astro-ph/9912182]. 
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Figure 7.3 shows that Chr. , has an ¢-dependence very different from that 
of the scalar contribution calculated in the previous section, so the agree- 
ment between existing observations and theory including only scalar per- 
turbations sets an upper bound on the strength of the tensor perturbations. 
Under the assumption (as suggested by theories of inflation discussed in 
Chapter 10) that Dj and R? have about the same q-dependence, the WMAP 


collaboration has concluded? after three-years of operation that r < 0.55, 
at a 95% confidence level, where r = 4|D9|?/|R}|. This limit arises chiefly 
from temperature rather than polarization measurements, because of the 
much greater signal to noise ratio of the temperature measurements; the 
limit on r from polarization measurements alone is r < 2.2. 

Strictly speaking, the limit r < 0.55 does rely on polarization mea- 
surements, which are used to determine the optical thickness of the reionized 
plasma. This is needed to determine the value of the slope parameter ng for 
scalar modes, which in turn is needed in subtracting the scalar contribution 
to Crr from the observed values in order to set a limit on the tensor 
contribution. But cosmic variance sets a limit to the accuracy with which 
this subtraction can be made, and in the long run the best upper limits or 
the actual detection of tensor modes will come directly from polarization 
measurements, to which we now turn. 


7.4 Polarization 


Observations of the cosmic microwave background reveal not only its inten- 
sity in various directions, but also its polarization. The microwave back- 
ground is expected to be polarized because of its scattering by free 
electrons,! such as were present around the time of recombination, or during 
the later period of reionization due to ultraviolet light from the first gen- 
eration of stars. Polarization measurements have become of importance in 
learning when reionization began, and in disentangling the effects of reion- 
ization from the primordial intensity of fluctuations, and they may become 
even more important in future, in revealing the effects of gravitational waves 


8L, Page et al., Astrophys. J. Suppl. Ser. 170, 335 (2007) [astro-ph/0603450]. 

IM. J. Rees, Astrophys. J. 153, L1 (1968). Polarization correlations at small angu- 
lar separation were considered in early papers on the polarization of the cosmic microwave 
background; see e.g. A. G. Polnarev, Sov. Astron. 29, 607 (1985); R. Crittenden, 
R. L. Davis, and P. J. Steinhardt, Astrophys. J. Lett. 417, L13 (1993); D. Coulson, 
R. Crittenden, and N. Turok, Phys. Rev. Lett. 73, 2390 (1994); A. Kosowsky, Ann. Phys. (N.Y.) 
246, 49 (1996). Then all-sky analyses were given by U. Seljak and M. Zaldarriaga, Phys. Rev. Lett. 
78, 2054 (1997) [astro-ph/9609169]; M. Zaldarriaga and U. Seljak, Phys. Rev. D 55, 1830 (1997) [astro- 
ph/9609170], and by M. Kamionkowski, A. Kosowsky, and A. Stebbins, Phys. Rev. Lett. 78, 2058 
(1997) [astro-ph/9609132]; Phys. Rev. D. 55, 7368 (1997) [astro-ph/9611125]. 
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produced during inflation. 

In Section 6.1 we introduced a dimensionless photon intensity perturba- 
tion matrix Jj(x, p, t), defined by Eq. (6.1.13). It is Hermitian, transverse 
in the sense that l 

D Ji (X, P, t) =p Jij(X, p, th=0, (7.4.1) 
and, as already noted in Section 7.1, its trace at x = 0 and t = fo is related 
to the fractional photon temperature fluctuation seen from earth at present 
in any direction fi by 


ATA) 1 E 
= —J;(0, — ; 4.2 
To qi (0, n, to) (7 ) 


For ñ = —p in the three-direction 2, any matrix with these properties can be 
put in the form 


ATA +0 UG@)-iWE) 0 
Ji(0,—2, 10) = 7 UG)+iVEG) ATE) —-O@ 0|, (7.43) 
0 0 0 


where Q, U, and V are three real functions of direction with the dimen- 
sions of temperature, known as Stokes parameters.” In this case the Stokes 
parameters can be expressed in terms of the matrix Jj, as 


QC) iU) = ex i) ex jE) Jy, -2, to) , (7.4.4) 


T 
2 
To 


VE) = 4 


e- 12) e+ ;@ (IO, -2, t0) — Ja0, -2,40)) , (7.4.5) 
where e+ (ĉ) = (1, +i, 0)/./2 are the polarization vectors for a photon com- 


ing from the three-direction. Accordingly, for a photon coming from an 
arbitrary direction 7, we define the Stokes parameters as 


T 
OÂ) +iU@) = ses (A) ex Â) Jj, —A, to) , (7.4.6) 
T, 
V) = ae iA) e+ (A) (7500, —h, to) — Jii(O, —A, to)) , (7.4.7) 


where e+ (ñ) are the polarization vectors for a photon coming from the 
n-direction. Writing 7 in terms of polar and azimuthal angles 6 and ¢ as 


ñ = (sin@ cos@, sind sing, cos@) , (7.4.8) 


2The Stokes parameters are sometimes defined with extra constant factors to give them the dimen- 
sions of intensity rather than temperature. 
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we can take 
e1) = (Ô +id)/V2, (7.4.9) 


where 6 and ¢ are orthogonal unit vectors in the plane perpendicular to ñ, 
in the directions of increasing 6 and ø, respectively: 


6 = (cos0 cosġ, cos@sing, — sing) , d = (—sing, cos¢,0). 
(7.4.10) 


It is the Stokes parameters that are measured in observations of the 
microwave background. The scattering of light by non-relativistic elec- 
trons does not produce circular polarization, and therefore we expect that 
all microwave background photons are linearly polarized, so that Jj is 
real, and therefore V = 0. In this case the fractional intensity pertur- 
bation e'e J; reaches a maximum value 2(AT + y U? + Q?)/Tp for a real 
polarization vector e in a direction 6 cosé + ĝ sin £, where tan 2£ = U/ Q. 
For a polarization vector in the orthogonal direction, e'e’ Jij takes its mini- 
mum value 2(AT — y U2 + Q?)/To, giving a total fractional intensity per- 
Mirbauon 4AT/To, as is necessary since the photon energy density goes 
as T". 

Now we have to face a complication: As we have defined them, the Stokes 
parameters are not rotational scalars. That is, under an arbitrary rotation 
xi > x} = Rjx; of the three-dimensional coordinate system, we do not have 
O'A) = OA) or U'A) = Un). This is because the polarization vectors 
are not really three-vectors. We note that 


e+ ;(7) = Si (Het; Ê) 5 (7.4.11) 


where S;(7) is a standard rotation that takes the three-axis into the direction 
ñ. For n = (sin 0 cos@, sin @ sin d, cos 0): 


cos cos@ —sing sin cosd 
Sj(n) = | cos@ sing cosd@ sind sing] , (7.4.12) 
— sind 0 cos 0 


and again e4 (ĉ) = (1, +i,0)//2. For an arbitrary rotation R, we can write 


ex (RÂ) = Sj(Riexj(2) = |R sia] [OR SRA] es. j(2) 


Now, S(Rf) takes the three-axis into the direction Rì, RT! takes this into 
the direction ñ, and then S7! (ñ) takes this back to the direction of the three- 
axis, so S~!(#)R~!S(Ra) leaves the three-direction invariant, and must 
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therefore be a rotation by some angle w(R, ñ) around the three-direction. 
Acting on e4 (ĉ), this yields a factor e*'”, and so 


e4 (RÂ) = +Y RA Ries (A). (7.4.13) 


The matrix Jj; is an ordinary three-tensor, in the sense that an arbitrary 
rotation R takes Jj; into Jij with 


Jy (RÂ) = RiRiJu@ , 


Because RTR = 1, a rotation R subjects the Stokes parameters (7.4.6) to 
the transformation Q > Q', U > U’, with 


O'A) EiUA) = e RA (OM) HiU]. (7.4.14) 


For this reason, if we were to expand the Stokes parameters in a series 
of ordinary spherical harmonics, as we do for scalars like the tempera- 
ture fluctuation, then the expansion coefficients would not transform under 
rotations according to the usual representations of the rotation group. 
Instead, we expand the Stokes parameters O(n) and U (ñ) seen in a direc- 
tion f in a series of functions? VP (A) with the same dependence on the 
polarization vectors as the Stokes parameters themselves: 


love) rA 
OA HUA =Y Y aP emV A , (7.4.15) 
l=2 m=—£ 
VW (ni) =2 ce ey (fn) e (A) ViV; Ye") , (7.4.16) 
f MEDL T H t G 


where V is the angular part of the gradient operator: 


ee ee eee: 
e Q 


=0 . 7.4.17 
30 ji sind 0d ( ) 


The subscript “P” (for “polarization” on the coefficients ap em is introduced 
here to distinguish them from the coefficients entering in the expansion 
(2.6.1) of the temperature fluctuation, which in this chapter are denoted 
atm. Under a rotation R, the ordinary spherical harmonics transform as 

RD => DRI O (7.4.18) 


mm 


m 


3These area special case of functions introduced in a study of gravitational radiation by E. T. Newman 
and R. Penrose, J. Math. Phys. 7, 863 (1966). In their notation, which is used in some recent papers on 
microwave background polarization, the function Yj" is denoted 2 Y¢ m. 


373 


7  Anisotropies in the Microwave Sky 


where D® (R) are the unitary irreducible matrices* of rank 2£ + 1 that form 
a representation of the rotation group, in the sense that D (R1) D® (R2) = 
D©(R,R2). It follows then from Eqs. (7.4.13), (7.4.16) and (7.4.18) that 


VIRA) = e4 RA Y DO (RDV G), (7.4.19) 
m 
and hence also 
VRS ERASED AR VT a) (7.4.20) 
m 


For this reason, Y; and its complex conjugate are known as spherical 
harmonics of spin 2 and —2, respectively. Just as for ordinary spherical har- 
monics, it follows from these transformation properties that f dQ Yj" rym 
vanishes except for £ = ¢’ and m = m’, and in that case is independent 
of m. The factor 2,./(€ — 2)!/(€ + 2)! is inserted in Eq. (7.4.16) to make 
these functions satisfy the orthonormality condition 


J dQ VEY = stw ômm 5 (7.4.21) 


just like the ordinary spherical harmonics Y;". Direct evaluation of 
Eq. (7.4.16) gives’ 


PORN ES 
Ye O= EF 


ð ay ð ð 
(Z +iesco) - cot (7+ iese0) Y (a) 


imp |€- 2)! ¢ —|ml)! 26+ 1 
(C+ 2)!(€+|ml)! 4r 


go +u)(m-— Ep) Piu) 
1 — p2 


| -e+ DP u) 


4See, eg., A. R. Edmonds, Angular Momentum in Quantum Mechanics, (Princeton 
University Press, Princeton, 1957): Chapter 4; M. E. Rose, Elementary Theory of 
Angular Momentum (John Wiley & Sons, New York, 1957): Chapter IV; L. D. Landau and E. M. 
Lifshitz, Quantum Mechanics — Non Relativistic Theory, 3rd edn. (Pergamon Press, Oxford, 1977): Sec- 
tion 58; Wu-Ki Tung, Group Theory in Physics (World Scientific, Singapore, 1985): Sections 7.3 and 8.1. 
Note that the rotation matrices we use are appropriate for the representation furnished by the spherical 
harmonics we use, with phases such that YP’ @)* = Yp MA). The corresponding angular momen- 


tum matrices J are not the usual ones, for which the elements of ie $ ad are real and positive. 


If we changed the phase of the spherical harmonics so that Y7"(f)* = (-1)""Y,- mM), then the rota- 
tion matrices would be generated by conventional angular momentum matrices, with the elements of 


J 9 + a? real and positive, but then we would have to introduce phases into the definition (7.4.25) of 
the E and B-type partial wave amplitudes, different from those used by Zaldarriaga and Seljak in ref. 1. 

>The final expression is taken from Zaldarriaga and Seljak, ref. 1, while the first expression, which is 
derived here from our definition (7.4.16), is their definition of the spin +2 weighted spherical harmonic. 
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2(e m 
J ( me p w)| 


(€+2)'"@—2)! 4x 


pe Veen) 


(ST cor 672); (7.4.22) 


1/2 
= eimo Caries a Dies J sin?"(0/2)(—1) "0/2 


where u = cos@ and P/"(1) are the usual associated Legendre functions. 
(Here we will only need the first expression in Eq. (7.4.22).) Unlike the ordi- 
nary spherical harmonics, the Y;” do not satisfy any simple reality condition. 
Rather, the complex conjugate of Eq. (7.4.15) gives 


ee) £ 
Of) -—iU AM => Y Apm A, (7.4.23) 
£=2 m=—£ 
where 
mays — 2 [= ?)! jr AVIV) Y” À 7.4.24 
Ye A)“ = agp i e_;(n) Vi Vj e (n) , (7.4.24) 


in which we have used the reality property of the spherical harmonics 
employed here, ¥j"* = Y; ™”. 

Because VY; does not satisfy any simple reality condition, neither does 
apm. Instead, we can define coefficients 


AE, £m = - (apem +ape aa /2 » ABem=i (aP,em — abe s /2 ; 
(1.4.25) 


This is a useful decomposition, because of the properties of the coeffi- 
cients under space inversion. If we reverse all three coordinate axes, then 
6 +> x — 0, while p > ẹ +x, so Ô > 6 and ¢ > —¢. It follows then 
that the polarization vectors (7.4.9) of definite helicity are interchanged 
under space inversion; that is, e+ (—ñ) = e+ (ñ), and therefore according to 
Eq. (7.4.6), under space inversion O(n) > Q(—n) while U (ñ) — —U(—n) 
and V (ñ) —> —V(—n). (The reader will later be able to check these space- 
inversion properties of Q and U, by noting that these are the changes 
that are produced if replace the stochastic parameters a(q) and £ (q, A) in 
Eqs. (7.4.31) and (7.4.40) with their space-inversion transforms a(—q) and 
B(—q, —A), respectively.) Thus by applying a space inversion to Eq. (7.4.15), 
we find that space inversion takes the partial wave amplitudes ap em into 
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/ 
aP em where 


Q(—â) — iU (—ñ) 


lo) £ 
= 2) Da Oih 


@ — 2)! 


GDI ei ® e+ ® Vi Vj Yi" (A) 


L=2 €=0 m=—£ 
lee) £ £ 2)! SES 
= 2(-1)° dX do d'p om Gad e— j(—fi) e_ ;(—fA) Vi Vi Y7"(—A) 
love) £ 
= (190 $ apend” A" , 
L=2 m=—£ 


the factor (—1) coming from the reflection property Y" (—ìà) > (=1)" 
Y;"(n). Comparing this with Eq. (7.4.23), we see that space inversion 
changes apm into dP em = OT, _m SO it changes ag, em (and also 
the corresponding temperature multipole coefficient arem) by a sign (—1)£, 
while it changes ag,em by a sign —(—1)°. 

These coefficients are stochastic variables, governed by a probability dis- 
tribution that is presumably invariant under space inversion, so there can 
be no bilinear correlation between polarization fluctuations of B and E- 
type, or between temperature fluctuations and polarization fluctuations of 
B type, though there can be correlations between temperature fluctuations 
and polarization fluctuations of E type. Taking into account also the rota- 
tional invariance of the probability distribution, the only non-vanishing 
bilinear averages are of the form® 


(aF ¢m4T,¢ m') = Crr,e8e,¢ Sm! » (7.4.26) 
(aF ¢m@E,e m) = CTE, € 50,0 mm » (1.4.27) 
(dE ¢m4E, Um) = CEE, ¢ 50,0 mm » (7.4.28) 
(23 em 4B, em!) = CBB, € 50,0 mm - (7.4.29) 


All Cyy,¢ coefficients have the same dimensions, of square temperature. By 
their definition, the a7, ag, and az all satisfy the reality conditions 


* * * 
AT em = aT k—m >» AE em =dEfL—m , AB em = 4B, —m » (7.4.30) 


Here aT em Is the coefficient agm in Eq. (2.6.1). The multipole coefficients Cyy ¢ defined here are 
the same as those defined by Zaldarriaga and Seljak, ref. 1. They are related to coefficients defined 
by Kamionkowski, Kosowsky, and Stebbins, ref. 1, by cf = Cu EEP NT, cE = Copal? /2T?, 
CF G = -C TE, IVIT; R , where J is the radiation intensity. (The superscripts G and C stand for 
“gradient” and “curl”, because the pattern of polarization vectors of type E or B resemble the pattern 


of velocity in potential or solenoidal flow, respectively.) 
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and hence the coefficients Cyy¢ are all real, while the Cyy, are all also 
positive. The results of measurements of correlations between tempera- 
ture and/or polarization fluctuations in different directions are generally 
reported in terms of the Cyy e. 

Another simplification provided by the decomposition of the Stokes 
parameters into E and B terms is that, as we will see, scalar perturba- 
tions can only contribute to the E terms. This result is important, because 
it means that any sign of a primordial B-type polarization’ will be clear 
evidence for cosmological gravitational waves, of the sort that we shall see 
in Chapter 10 are expected to be produced during inflation. 

We now turn to the calculation of the multipole coefficients (7.4.27)-— 
(7.4.29). The hydrodynamic treatment that worked reasonably well for 
temperature correlation functions is not well suited to the treatment of 
polarization, so we will rely on the more accurate kinetic theory outlined 
in Section 6.1. Here we must distinguish between scalar and tensor modes. 
As already mentioned, these do not interfere, so the multipole coefficients 
can all be written as a sum of a scalar and a tensor term, denoted by super- 
scripts S and T: 


Crye= Chrys + Ciri . (7.4.31) 


A. Scalar modes 


For scalar modes, the matrix Jj; in Eq. (7.4.6) is given by the Fourier integral 
(6.1.18) and the line-of-sight integral (6.1.36): 


arses ‘ SPOR s isa ae ag 
Jy (0, —ñ, to) = | d°q alq) dt exp| —iq- Pp 7 dt welt) 
ti © a(t’) t 


x |= (64 = hio) (40) = 0-00) 


3we(t e 
+ O 85 — Ae oCg.D 

3a@¢(t Bae Ne NAV E TE 
+ ot ng, nai —ni(q- i) G —nj(q- â) 


2we(t) 
a(t) 


[5;, — Ain; | Np Sux (q, J ; 


7Weak lensing by foreground objects converts the E-type polarization produced by scalar perturb- 
ations into a B-type polarization that may be large enough to interfere with measurements of the 
primordial B-type polarization; M. Zaldarriaga and U. Seljak, Phys. Rev. D 58, 023003 (1998)[astro- 
ph/9803150]. For a review, see A. Lewis and A. Challinor, Phys. Rep. 429, 1 (2006) [astro-ph/0601594]. 
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Since nje+ ;(7) = 0 and e, ;(7) e+ ;(7) = 0, the only term in the integrand 
that contributes to the Stokes parameters is the one proportional to ĝ;ĝj: 


i . N 3T P A nR 
oSA + iUS (A) = eri) e4; Â) fea ĝiĝj a (q) 


to to dt! 
x J dt P(t) TI (q, t) exp (a . af a) : (7.4.32) 
ti © a(t’) 


where P(t) dt is the probability that the last scattering occurs between ¢ and 
t + dt: 


t 
P(t) = w(t) exp (- J oyar) ; (7.4.33) 
t 


Asa reminder: a(q) is the stochastic parameter for whatever mode (presum- 
ably the non-decaying adiabatic mode) is assumed to dominate the scalar 
perturbations; œe(t) is the photon collision rate; tı is any time taken early 
enough so that a photon present at that time would suffer many collisions 
before the present; and TI (q, t) is a source function, given by Eq. (6.1.30) in 
terms of the partial wave amplitudes of the temperature and polarization 
perturbation amplitudes: 


Ng.) = AGa D + AP {G0 HAAD. (7.4.34) 


To use Eq. (7.4.31) to calculate the EE and TE multipole coefficients, 
we recall the familiar formula 


exp (ig v) = 4r SS ifje) YO) Ye" , 


l,m 


where here v = nig Jo dt'/a(t'). Acting on this, we can replace ĝ; with 
~id/dv;. Since 6 = fi, we can write 3/3v; = 7j9/dv + V;/v, where V is 
the angular gradient operator (7.4.17) acting on 7. Since nije; = 0 and 
e+ je+; = 0, Eq. (7.4.31) then reads 


Bhi Toe sci 3x T, A noe Pak 
O° (ft) + iU* (A) = - 5 es i(fi) e+ VIV) Di YPM) [ea alq) 
£m 
ey ae ic(aro) m 
x Y” O) [ t P(t) MG Gye (7.4.35) 


where here r(f) is the radial coordinate of a point from which light emitted 
at time ¢ would just reach us at present 


to dt! 
r(t) = Í ae (7.4.36) 
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(Note that V;/; doesn’t contribute here, because it equals ô; — A;f;, which 
vanishes when contracted with e4j;e,;.) Comparing Eq. (7.4.34) with 
Eqs. (7.4.15) and (7.4.16), we see that the scalar contribution to ap,em is 


3r Toit [(€+2)! fo 
S NOEN 3 mx 
dm=- pea | oof a (1.439 


Je (4 r(t )) 
q? r(A) 
To check the reality properties of ap em, we note that Eqs. (6.1.18) and 
(6.1.21) require that w*(q)I*(q, 1) = a(—q)I1(q, t), while ¥7"(q) = (=1) 
Y; ”™*(—ĝ), and of course i* = (—1)'7*, so OP om = aD _m: Inspection of 
Eq. (7.4.25) then shows that, as promised, the scalar modes contribute only 
an E-type polarization 


x POT, t) (7.4.38) 


a$ em = -aÈ em > aZ om = 0. (7.4.39) 


We can now give formulas for the scalar contribution to the EE and TE 
multipole coefficients defined by Eqs. (7.4.28) and (7.4.27). Recalling the 
normalization condition (7.2.5) of the scalar stochastic parameter a(q), we 
see from Eqs. (7.4.36) and (7.4.37) thatë 


2 
Ir TÈ (€ +2)! f” to Je (4 r0) 
bo A 0 2 PON l 
CEE, 16 am | q“ dq [ dt P(t) Tq, t) ee 


(7.4.40) 


Combining Eq. (7.4.36) with the general formula (7.2.4) for the scalar 
contribution to the temperature partial wave amplitudes, we find in the 
same way that 


[E+D! f” 
Che g=—30° Ty > q dq 


i ie(ar) 
x f dt PONG.) aH 
x Í : lie (4 r0) F(q, t) +j; (4 r0) Gq, D +i (4 r0) H(q, | 


(7.4.41) 


8This is the same as Eq. (17) of Zaldarriaga and Seljak, ref. 1, except that IT here is 4 times their TI. 


379 


7  Anisotropies in the Microwave Sky 
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Figure 7.4: The multipole coefficient £(£ + 1)C pe ¿/27 in square microKelvin, vs. £, for the 


cosmological parameters given in Eqs. (7.2.44) and (7.2.45). 
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Figure 7.5: The multipole coefficient £(£ + 1)C S ¿/27 in square microKelvin, vs. £, for the 
cosmological parameters given in Eqs. (7.2.44) and (7.2.45). 


where F(q,t), G(q, t), and H(q,t) are the quantities (7.2.7)-(7.2.9). The 
values of Chr , and Cr 7 calculated? for the cosmological parameters 
(7.2.44)-(7.2.45), are shown in Figures 7.4 and 7.5. Comparing these figures 
with each other and with Figure (7.3), we see that C or. 2<KC PE, pee A E 
The microwave background polarization is small, because the universe goes 
swiftly at the time of recombination from a state of nearly perfect thermal 
equilibrium to one of nearly perfect transparency, and photons are not 
polarized in thermal equilibrium. 

Eqs. (7.4.38) and (7.4.39) must be used for maximum numerical accuracy, 
but we can get a good idea of the £ dependence of the multipole coefficients 
for £ > 1 by using much simpler approximate versions. The function P(t) 
in Eq. (7.4.38) is sharply peaked at a time tz of last scattering, and it has 


*nttp://lambda.gsfc.nasa.gov/data/map/powspec/wmap lcdm_pl_model_yrl 
TECHE 
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{ dt P(t) = 1, so to a fair approximation we can do the integral over time 
by simply setting t = tz and dropping the factor P(t). Also, by using the 
same approximations that led us from Eq. (7.2.13) to Eq. (7.2.17), we find 


that for £ > 1 
9r? L (9 d £ 
Cire nm J p n ( f it) f 
1 


32r? ByE- Vr 


This does not give a reliable result for the magnitude of C> Ee because the 
function I1(q, t) varies rapidly around the time of recombination, but since 
this function falls off rapidly with g, so that the integral is dominated by 
b ~ 1, this approximation does suggest that the peaks and valleys in c3 EL 
arise chiefly from a factor I1*(¢/rz, tz). On the other hand, as remarked 
in Section 7.2, the peaks and valleys in Cre arise chiefly from a factor 


F*(€/rz), and by the same reasoning, the peaks and valleys in Che p arise 
chiefly from a factor F(€/rz)M(€/rz, tz). 

To get an idea of the g-dependence of TI (q, tz), we note first that, to the 
extent that the photon polarization arises solely from the last scattering, 
TI (q, £) would be given by just the first term A (q, t) in Eq. (7.4.33). This 
is because the differential cross section for the scattering of an unpolarized 
photon of initial momentum p with |p| < me by a non-relativistic electron 
to yield a photon with real polarization vector e is!? 


If the distribution of initial momenta p were spherically symmetric then the 
average of this cross section over initial directions would be independent of 
e, so the final photon would be unpolarized. But for a direction-dependent 
phase space density n, (p) of photons with momentum p at the position 
and time of last scattering, the intensity of photons after last scattering 
coming from direction # and having polarization vector e (with e? = 1 and 
e - ñ = 0) is proportional to 


[remo - (p-e)’) l 


The part of J; that contributes to the Stokes parameters is thus proportional 
to the traceless part of f d?p p ny,L(P)ÎiPj, which in turn is proportional to 
the £ = 2 part of Ae evaluated at the time and position of last scattering. 


l0See, eg., QTF, Eq. (8.7.40). This was first calculated by O. Klein and Y. Nishina, 
Z. Phys. 52, 853 (1929). 
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The additional terms Ay) (q,t)+ a (q, t) in TI (q, t) take account of what- 
ever polarization the photon may already have acquired by the time of last 
scattering, and can be expected to be relatively small since photons must be 
unpolarized under conditions of rapid photon scattering. But then, accord- 
ing to Eq. (6.1.32), 


N4, tr) ~ AP, D = mg (tL) /by Ctr) . 


Recall that, according to the first line of Eq. (5.1.43), ðið (x, t) is the term 
in the scalar part of 57"; that is not proportional to ô;j, while Eq. (B.50) tells 
us that for short mean free times, this term equals!! 


ði = a~? ny 3d Ñ , 


where n, is the shear viscosity due to photon momentum transport, given 
in terms of the photon mean free time t, by!? n, = #8 /yty, and di(x, r) is 
the gauge-invariant velocity potential !3 
- a&b 
ôu = ôu — aF + ——., 
2 

with F and B metric perturbations defined by Eqs. (5.1.32) and (5.1.33). 
Also, Eq. (7.1.45) (which was derived for a gauge in which the metric com- 
ponent F vanishes) gives the coefficient of the stochastic parameter «œ (q) in 
the Fourier transform of the gauge-invariant velocity potential in terms of 
the form factor G(q) as dug (t,) = —a(t_)G(q)/g. We conclude then that 


6q - 


1 


A E E 

? 45a? (t1) ” 
where f, is some appropriate average of t, (t) during the era of recombina- 
tion. Thus the peaks and valleys in Con ¿ and in \Cĵ¢, ¿| are more-or-less 
the same as those in |G(q/rz)|? and |F (¢/r_)G(q/rz)|, respectively, and we 
recall that the peaks and valleys in Cr. , are essentially those in |F (q/rL) ie 
Eqs. (7.2.23) and (7.2.24) give 


oO 


R t 
Fg) = Z [BRETO — A + RDT eh T S) cos x) | , 


11 Here we use the relation bul. j = buj:! = a? uij — a~*hyaa. 

12N, Kaiser, Mon. Not. Roy. Astron. Soc. 202, 1169 (1983). For a discussion, see footnote 5 of 
Section 6.4. 

13 We recognize di as the same velocity potential that appears in Eq. (7.1.37) as the Doppler contribu- 
tion to the temperature shift in the class of gauges with vanishing metric component F. In the special 
case of Newtonian gauge we have F = B = 0, and du = du. 
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& VIRE 
= “SOF RA 
where x(q) is an approximately linear function of q: 


ew Jo" Tat SC) sin x(q) , 


tL qdt 
w= f + Ak), 
OD Sys eR 
and the argument « of the transfer functions S, T, and A is proportional 
to g. As remarked in Section 7.2, the peaks in Cre are thus roughly at 
x(e/rt) = a, 2m, 3a, ... On the other hand, we expect the peaks in 


Chee to be roughly at the peaks in sin (x (£ /rt)) , or in other words, at 
X(£/rL) = 7/2, 37/2, 57/2, ..., and the peaks in ICFe el to be roughly at 
the peaks in [cos (x (e/r1)) sin (x (e/r1)) l or in other words, at x (£/rL) = 
w/4, 37/4, 57/4, ... In particular, we expect a peak in Cae before the 
first peak in Chr. p followed by one peak in Cop p between successive peaks 
in C ps while there should be two peaks in ICFz ¢l before the first peak 
in C igs followed by two peaks in ICP el between successive peaks in 


Cer. ¢ This is precisely the pattern seen in computer calculations.!4 For 
instance, for a plausible set of cosmological parameters there are indeed 
two peaks in ICP el (at £ = 36 and £ = 144) below the first peak in Cre 
at £ = 230. 

The E-type polarization and temperature—polarization correlation were 
first detected by the Degree Angular Scale Interferometer (DAST) collabora- 
tion.!> Then the Wilkinson Microwave Anisotropy Probe (WMAP) collab- 
oration measured the coefficients Crge with good accuracy over a range of 
multipole orders from £ = 2 to £ ~ 570.!° The results for £ > 10 are in good 
agreement with the concordance model, assuming primordial fluctuations 
with a spectrum close to the Harrison—Zel’dovich form R? « q~*/?, and 
a constant vacuum energy along with cold dark matter, baryonic matter, 
photons, and massless neutrinos, and using the same values of the cos- 
mological parameters Q ph, Qu’, h, and N as used in fitting the model 
to measurements of temperature anisotropies. In particular, there is clear 
evidence of the second expected peak in |Crg,el, at around £ ~ 140, and 
the third expected peak, at £ ~ 300. More recently, the QUaD collabora- 
tion!” has reported preliminary results from the first season of operation of a 


14For instance, see http://lambda.gsfc.nasa.gov/data/map/powspec/wmap_lcdm_ 
pl_model_vri_vl.txt. 

15]. Kovac et al., Nature 420, 772 (2002). 

16A. Kogut et al., Astrophys. J. Suppl. 148, 161 (2003). 

!7p Ade et al., 0705.2359. 
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microwave polarimeter at the South Pole, which show clear evidence for the 
first peak in Cgg,e at £ ~ 350, about where it is expected from scattering 
around the time of recombination. But if reionization were not included 
in the cosmological model, there would be a clear discrepancy between its 
predictions and the data for Crge for £ < 10. This discrepancy is attributed 
to polarization caused by scattering in a plasma that has been reionized by 
ultraviolet light from a first generation of stars. 

The most important application of polarization measurements so far has 
been in working out the history of the reionization of intergalactic matter at 
redshifts much less than zz ~ 1090, which contributes to the last-scattering 
probability distribution P(t). Because nearby events subtend large angles, 
the additional terms in Cgg and Crge due to reionization are negligible 
except for relatively small £, in fact £ < 10. After three years of oper- 
ation, the WMAP collaboration!® found on the basis of the EE correla- 
tion alone that if reionization is sudden and complete at a redshift z,, then 
7 = 10.9427, corresponding to an optical depth t = 0.09 + 0.03. This 
result is in line with expectations of the onset of star formation, and it has 
played an essential role in the use of temperature correlations to determine 
the magnitude of R4 and to set an upper limit on Då, discussed in Sections 
7.2 and 7.3, respectively. Scattering by the reionized plasma produces a 
peak in Cgg e much like the peak at £ ~ 140, but shifted to much smaller 
£, around £ ~ 4, because it occurs at an angular diameter distance much 
smaller than the angular diameter distance of the era of recombination. The 
WMAP measurements and other measurements of Cpg for larger values 
of £ are in good agreement with the theoretical formula (7.4.38) for Chr 2 
as shown in Figure 7.6, and future measurements are expected to further 
reduce the uncertainties in this comparison. 


B. Tensor modes 


For tensor modes, the matrix J; in Eq. (7.4.4) is given by the Fourier integral 
(6.6.9) and line-of-sight integral (6.6.24): 


LETS). [ea etx Bq, A) 


A=+2 


t t dt’ t 
x J dt' exp (-i of — -f dt” ot) 
ti v alt") r 


18L, Page et al., Astrophys. J. Suppl. Ser. 170, 335 (2007) [astro-ph/0603450]. 
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Figure 7.6: Observed values of the multipole coefficient (€ + 1)Cege/2m in square 
microKelvin, vs. £, from L. Page et al., astro-ph/0603450. (Note that Ceg¢/27 is mul- 
tiplied here with £ + 1, rather than the usual (£ + 1).) Dark squares are the WMAP data; 
the triangles are BOOMERanG data; the lighter squares are the DASI data; the diamonds 
are the CBI data; and the asterisk is the CAPMAP datum. The solid curve is the theoret- 
ical curve for (£ + 1)C S Et /2x, with cosmological parameters taken from a fit to WMAP 


temperature and low-é polarization data. 


x | — PrP (èy - pip) eri Â, à) Dg (t’) 
— welt’) Vq, N (ejN — PPren, %) 
— PiPKeik(G%) + PiPjDrPier Â, »)| ; 
Again using the conditions je, ;(7) = 0 and e+ ;:A)e4 i (ñ) = 0, we see that 


the only term in the integrand here that contributes to the Stokes parameters 
in Eq. (7.4.4) is the one proportional to eW. This term gives 


1 
OTA HUTA =- eE f aq pa 
A=+2 


i to ` [® dt 
saaa f aofia f E) Pavan. 044) 
ti © a(t’) 
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where (q, A) is the stochastic parameter for a gravitational wave of wave 
number q and helicity 4; P(t) is the last-scattering distribution (7.4.32); 
and W(qg,t) is a source function, given by Eq. (6.6.21) in terms of the 
partial wave amplitudes of the temperature and polarization perturbation 
amplitudes: 


Y(q, t) = aa, mee ADi Do Zaai E SAN t) 


6 Se 3 > 
+>=4Ap Ap 44 
7 3 (9,0) 70 4(qt) . (7.4.43) 


Also, e;(q, A) is the polarization tensor 
eij (ĝ, à) = Six(Q Si ekr Ê, a) » (7.4.44) 


where S; (ĝ) is the standard three-dimensional rotation (7.4.12) that takes 
the three-axis into the direction of ĝ, and e;;(Z, A) is the polarization ten- 
sor (5.2.15) for waves traveling in the three-direction, with non-vanishing 
components 


1 
Ro 
(7.4.45) 


e11, +2) =e (2, +2) = Fi e12, +2) = Fi e2 (, +2) = 


We will first show how to find the tensor multipole coefficients Ciri 
c} Bo and Cfr for a given source function ¥ (q, t), and then report the 
results of a numerical evaluation using W(gq, t) calculated as described in 
Section 7.3. 

We begin by deriving a formula for the coefficients ap ¿m10 the expansion 
of the Stokes parameters in spin-weighted spherical harmonics: 


OTA) +iUT@ = X ap em VEO. (7.4.46) 
£m 


Using Eq. (7.4.40) and the orthonormality property (7.4.21) of the V7’, we 
have 


T 
= J di V MA $ iU(A)) 


T to i 
Dpi BCG.) J dt P(t) W(q, À gem(4r 4 a) , 


t 


(7.4.47) 
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where r(t) is the unperturbed co-moving radial coordinate of a photon at 
time ¢ that reaches us at time fo: 


to dt! 
r(t) = n; 7.4.48 
osf (1.4.48) 
and 
Sump, 9,4) = J BA Ye (A) eS e4 iej Ae, A). (1.4.49) 
ij 


Itis convenient first to calculate the amplitude gem(p, ĝ, à) for ĝ along the 
direction 2 of the three-axis. Using the graviton polarization tensor (7.4.43) 
and the photon polarization vector (7.4.9), a straightforward calculation 
gives 


ey ie eE, #2) = —a(e4 118) + ie 9) = et = cosy 
Á oe (7.4.50) 


where as usual ¢ and 0 are the azimuthal and polar angles of the direction 
a. Eq. (7.4.22) shows that (A) is e~""® times a function of 0, so the ¢ 
integral in Eq. (7.4.47) vanishes unless m = A = +2, in which case it just 
gives a factor 27. For vF we will use the top line of Eq. (7.4.22), which 
gives 


(¢— 2)! 


2€ + 1 (€—2)! 
+2 +2 et2ie 
> Y; AA HE) 2 0 
Vi (0, p) arp t” (0,6) = C D+ 7(cosé) , 
(7.4.51) 
where ð+ are the differential operators 
d 2 d 
n= (GF 2000) yl a 
d go atl + 
=(1-p? PEE ; 7.4.52 
d=) Ir IF i ( ) 


where u = cos 6, and P? (cos 8) is the usual associated Legendre polynomial 


a 
Pw) = (1 ngare ; (7.4.53) 


Putting this together, for q in the three-direction 2, Eq. (7.4.47) becomes 


oes ss [e+ D (@-2)! fT PET 2 
Zem(P, Z, £2) = ôm, +2 8 af. du[duP?(u) |e (+p). 


(7.4.54) 
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We use (7.4.51) for the associated Legendre polynomial, use integration by 
parts to let the derivatives in 3+ act on (1 F j)*e’?”, and apply the formula 


a? ; 4(1 +u) 
ga (lH? F ea) + (+450 + — 


= (1 = p?)(12 + 8p Fm- °F BH) 2) 


re F peir 


Replacing u with —id/dp, we find 


1 m(2€+1)(€—2)! f! 
„2, +2) = 3m 44] du P 
Eem(P, 2, £2) = ôm,+2 3 CID u Pelu) 
32 


L [12 F 8i F w — Pr E w] a- u etor 
ðu 
[nQe+1)(€—2)! st} 
= —dm du P 
ne 3 CEDI u Pelu) 


2 a 
— F Bip F 2i? — 
dp 


2 
3p? 


The integral over u is now simply 


+1 f 
J -du eP Palp) = Who). 


A straightforward though tedious calculation using the defining differential 
equation jy (P) + (2/p)j,(p) + (1 — LE + 1)/p7))je(p) = 0 gives 


2 
PV 5. EHD) 
(+5 PHO = Gay py? 


so finally 


R ; fm (2l + 1 
Sem(P,Z, +2) = —2if m, +2 TEED 
Je(p) 


ə ear 
12 ep — p? 8 (aa 
«| PU +p gaT Sip + lig 


(7.4.55) 
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Just as in Eq. (7.3.13) for temperature fluctuations, the amplitude for q 
in a general direction is given by applying the standard rotation S(q) that 
takes the three-axis into the direction of q: 


Bem 0.4 £2) = YD (SQ) em'(P, 2, £2) (7.4.56) 
m 


where D“ is the spin-é unitary representation of the rotation group. Using 
Eqs. (7.4.45), (7.4.54), and (7.4.53), we have then 


pF ae ae A 
tam = Toit JAE 2 J abad DRS) 


f" dt P(t) Yq, t) 
t 
a 3? 
x [hiset -ot oia Ftio sai z|] ; 
k P CE p=q4r (t) 
(7.4.57) 


In order to separate the E and B terms in as em We need the reality 
properties of 6 and D. First, note that 


ej, £2) = (Si + iS20) (S1 +i) 


1 
val 
and 

Si@ + iSp(q) = (coso coso Fi sing, cos sino +i cos, — sin o) ; 
so 
e} Q, +2) = ej, F2) = ey(—G, +2) . 


With the gravitational field dominated by a single mode, D (£) must be 
proportional to D,(t), so that by absorbing any phase in 6(q, +2) we can 
choose D,(t) to be real. The reality of Dj(x, t) then requires that 


B*(q, +2) = B(—q, £2) . (7.4.58) 


Also, by writing D (s@) = exp ( — ip Js) exp ( — iJ?) where J,” 


are the angular momentum matrices!’ for angular momentum £, we can 


19 As noted in footnote 4, the phases in these angular momentum matrices depend on the phase 
+ A / A A 
convention chosen for the spherical harmonics. They are related by f any. ti (n)*LY, E A) = JEn 
where L = —ix x V is the orbital angular momentum operator. 
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show that 
D.(S@) = (-1)°D%), als) ; (7.4.59) 
The sign (—1)* here cancels the same sign in i~* = (—1)*7°. It follows that 


the whole effect of taking the complex conjugate of OD tm and changing 


m to —m, is to replace the differential operator acting on je(p)/p with its 
complex conjugate: 


26+ 1 
abr = Toit |= 22 J 0 604.42) DRS) 


x T dt P(t) Y (q, t) (7.4.60) 
t 
ð a? a lj 
x 4 [12+ 80— — P? +p- + Bip 4 2ip? — Jo) , 
00 3p 0p 2 
p=qr(t) 


The definitions (7.4.25) then give 


2£+ 1 z 
am = — 7 To f aq 6G, #2) DaS) 
£ 


to 
<f dt P(t) U(q,t) 
ti 
a a> |j 
el ge aye eee ke) (7461) 
dp dp> | p? 
p=qr(t) 


2e+1 : 
dim = i ToS Dot | da pa ED DaS) 
= 


pe 


to 
x J arowan | [80 + 20° > | (7.4.62) 
ti P qrit) 


0p 


We can now calculate the multipole coefficients, using the stochastic 
averages 


(Bq. A) B*(q’.4')) = Pq = g) > (7.4.63) 
and the unitarity relation (7.3.17): 
A An 
[ea Dy, »(S@) DY m is (s@) = = ggg orn See G 


The sum over helicities +2 just gives a factor 2 in the EE and BB 
correlations, while as expected in the EB correlation there is a complete 
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cancelation, due to the + sign in Eq. (7.4.60). The tensor mode contribu- 
tions to the polarization multipole coefficients are then?” 


[0,6] 
Chre = 2r f q dq 


2 
to 9 9 , 
x J dt P(t) Yq, t) {| 12 + 8p— — p? te ae 
4 0p 0p p 
p=qr (t) 
(7.4.64) 
CO 
ChB = «73 f na dq 
" > ð J jelo) : 
x J dt P(t) Yq, t) {[8 +20 =| Ho (7.4.65) 
2 oe] p p=qr(t) 


The results (7.4.59) and (7.4.60), together with Eq. (7.3.14), allow us also 
to calculate the multipole coefficients in the correlation between temperature 
and polarization. In C TE the + sign in front of the integral over wave 
numbers in Eq. (7.4.60) produces a cancelation between the two terms in 
the sum over helicities +2, so that as expected Cii = 0. On the other hand, 
in Ciz the sum over helicities +2 just gives a factor 2, so that Eqs. (7.3.14), 
(7.4.61) and (7.4.59) give 


(€+ 2)! 
Cpe = -20 To U-Di 


to 9 j ; 
<J dt P(t)W(q,t) $ | 12+ 8p— -P +P- a 
t dp 0p p pe 


ar j ie (ar p(w) F 
es Gaa (7.4.66) 
where 
t 
d(q,t)= [Pu = zaw] exp (- ° welt’) a’) ; (7.4.67) 
t 


The calculation of the gravitational wave amplitude D,(t) and source 
function W (gq, t) is described in the previous section. For the same cosmo- 
logical parameters as assumed there, Eqs. (7.4.62) and (7.4.63) give the EE 


20These formulas, and also Eq. (7.4.64), are equivalent to those of Zaldarriaga and Seljak, ref. 1. 
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Figure 7.7: The multipole coefficients €(€ + 1)C LE, p/2m and £( + Chee /2n in square 
microKelvin, vs. £, for cosmological parameters given in Section 7.3, from ref. 21. The 
results agree to about 1% with those obtained (adopting the same cosmological parameters) 
by employing the computer programs CMBfast or CAMB, in which, instead of using the 
integral equation (7.3.22) as done here, the Boltzmann hierarchy is truncated at a maximum 
value of £. (The results of these two computer programs also differ from each other by about 
the same amount.) 


and BB multipole coefficients shown in Figure 7.7. (We will come to the TE 
coefficients below.) 

The results (7.4.62)-(7.4.64) provide a basis for highly accurate computer 
calculations of the tensor multipole coefficients, but a casual inspection of 
these formulas does not provide much insight to the qualitative behavior of 
these coefficients as functions of £. In particular, looking at Eqs. (7.4.62) 
and (7.4.63), we could hardly guess that Cape < Cire for all £ > 15, or 
that C A g and C A g, approach each other for £ < 100, as shown for one set 
of cosmological parameters in Figure 7.7. 

These results become much simpler and more transparent for large £. 
Recall that for p? — v? >> v4/3 (where v = £ + 1/2), the spherical Bessel 
functions have the well-known asymptotic behavior?! 


ios a cos [vtan paps 7/4] i (7.4.68) 
vv sin 


UES, Gradshteyn & I. M. Ryzhik, Table of Integrals, Series, and Products, translated, corrected and 
enlarged by A. Jeffrey (Academic Press, New York, 1980): formula 8.453.1. The same approximation is 
used by J. R. Pritchard and M. Kamionkowski, Ann. Phys. 318, 2 (2005) [astro-ph/0412581], but their 
subsequent approximations are very different from those made here and in ref. 21. 


392 


7.4 Polarization 


where cosb = v/p, with 0 < b < m/2. On the other hand, for v? — 
p> > vt’, je(p) is exponentially small. In the range in which |p? — v?| < 
v+/3 neither approximation is valid, but je(o) is a smooth function of p 
in this range, without the singularity at ọ = v that might be suggested 
by the factor 1/v sin b in Eq. (7.4.66). For v >> 1 this range contributes 
only a small part of the range of integration, and we would expect to be 
able to use the approximation (7.4.66). For pọ? — v? > vt? > 1, the 
dominant contributions to derivatives of j¢()/” come from terms in which 
the derivative acts only on the cosine in Eq. (7.4.66), so that 

a Jp) 


a 
[+s -P+ 


> —je(e) +ji (p) 
dp ap? | p2 f 


d+ sin? b) cosb 
— cos | v(tan b — b) — 1/4 7.4.69 
ae a VERN. 
f >/anb 
sp + 22| mi —> 2j; (p) > a ysin reps? sin [vtan —b)— z /4| 
ap | p v 
(7.4.70) 


(Letting the derivatives act on 1/vsin b would produce a non-integrable 
singularity at b = 0, but this is spurious, because the asymptotic formula 
(7.4.66) breaks down for p very near v, where in fact there is no singularity.) 
Then Eqs. (7.4.62)-(7.4.64) become, for v = £ + 1/2 > 1,” 


272 oo 
T x To 2 
Cree = J2 [ q dq 


x J dt P(t) Y (q, t) 
r(t)>v/q 


2 
1 + sin? 
: | d+ MA cos b cos [vtan —b)- “| 
sin b cos b=v/qr(t) 
272 loa) 
er =f Pas J dt P(t) ¥(q, 1) 
> v? Jo r()>v/q 
2 
x [2vsinb cosb sin [vtanb — b) — 7/4]! EIET 
cosb=v/qr(t 
(7.4.71) 


22R, Flauger and S. Weinberg, Phys. Rev. D 75, 123505 (2007) [astro-ph/0703179]. 
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272 T2 oo 
Cte y ew í [ q dq 


x J dt P(Y (q, t) 
r(t)>v/q 


gs d+ sin? b) cosb 
sin b 


cos [vitano —b)— v1] 


cos b=v/qr(t) 


x J dt' d(q, t) 
r(t)>v/4q4 


3 
x | Soe cos [vtan bV -=b — ni] . (7.4.72) 


sin b' cos b'=v/q r(t") 
This approximate result for C/,. , is compared with the exact result (7.4.64) 
in Figure 7.8. 

For Ci , and Cha g» we can usefully make a further approximation. 


The quantity b = cos! (varo) does not vary appreciably within the 
relatively narrow range of times ¢ in which the last-scattering probability 
P(t) is appreciable, so we can set r(t) equal to rz = r(t) everywhere except 


in the phase v(tan b — b), which for v >> 1 does vary over a wide range in this 
time interval. Furthermore, because this phase varies over a wide range, the 


(1+1) Cfg; 
27 
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Figure 7.8: The multipole coefficient (£ + Cte ,/2 in square microKelvin, vs. £, from 
ref. 21. The solid curve is taken from the exact formula (7.4.64); the dashed curve represents 
the approximation (7.4.71). In both cases, the method of calculating the gravitational wave 
amplitude and the source function, and the assumed cosmological parameters, are those 
described in Section 7.3. 
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difference between cos[v(tan b — b) — 1/4] and sin[v(tan b — b) — 1/4] is 
inconsequential, and we may as well replace both with cos[v(tan b — b)]. 
Then Eqs. (7.4.69) and (7.4.70) become” 


[0,0] 
e+ 1)Chey > nT f q? dq {(1 + sin? br)? cos? br}cosb,=v/arz 


v/TL 


cos [vtan b— b)| 


x J dt P(t) Yq, t) 
(t)>v/q 


Vsinb i 
cos b=v/qr(t) 
(7.4.73) 
5 ee 2 2 2 
LE + 1)Chpe > n° To J q` dq {4 sin" br cos” bLheosbr=v/gri 
v/rL 
cos [vtan b— b)| 
x dP) VG, D 4-—-__—— 
Noses vsinb cos b=v/qr(t) 
(7.4.74) 


(We have not set b = bz in the factors 1/vsin b in both integrals over t, 
in order to avoid a divergence in the integration over q at q = v/ry.) The 
results of using these approximate formulas are compared with the results 
of using the exact formulas (7.4.62) and (7.4.63) in Figures 7.9 and 7.10. 

Our approximate result for Cop. ¿ agrees with the exact result to about 
1%, which is good enough for any practical purpose. The approximate 
result for Chr , İs not quite as good, agreeing with the exact result only to 
about 14%, but these approximations are evidently good enough to use them 
to draw qualitative conclusions. One immediate conclusion is that, since 
(1 + sin? by) > 4sin? b; for all real bz, we have Cie > Cge Also, for £ 
small enough so that the wave number £/rr comes into the horizon before 
matter—radiation equality, say £ < 100, for which W(€/rz, tL) is small, the 
integrals over q are dominated by values for which cos b is small, in which 
case (1 + sin? b)? ~ 4sin* b, and hence Cree 2 C$ p As already noted, 
both properties are evident (for at least one set of cosmological parameters) 
in Figure 7.7. 

The smallness of C A pg, Makes it a difficult target for future observations, 
but the detection of the primordial BB mode would be of great importance to 
cosmology, as it would provide clear evidence of cosmological gravitational 
waves. The expectations for such tensor modes in theories of inflation are 
discussed in Chapter 10. 
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Figure 7.9: The multipole coefficient £(£ + Cha. ¿[27 in square microKelvin, vs. £, from 
ref. 22. The solid curve is taken from the exact formula (7.4.63); the dashed curve represents 
the approximation (7.4.70). In both cases, the method of calculating the gravitational wave 
amplitude and the source function, and the assumed cosmological parameters, are those 
described in Section 7.3. 
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Figure 7.10: The multipole coefficient €(¢ + 1)CT, ,/2x in square microKelvin, vs. £, from 


EE,¢ 
ref. 22. The solid curve is taken from the exact formula (7.4.62); the dashed curve represents 
the approximation (7.4.69). In both cases, the method of calculating the gravitational wave 
amplitude and the source function, and the assumed cosmological parameters, are those 


described in Section 7.3. 


The multipole coefficients Cez, Cape, and Cre may be measured 
by using data on correlations among the Stokes parameters and tempera- 
ture fluctuations in comparison with general theoretical formulas for these 
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correlation functions,” analogous to the general formula (2.6.4) for the 


temperature correlation function. To derive these formulas, we start by 
noting from Eqs. (7.4.15) and (7.4.25) that 


a mr —M/ny\* 
20) = -3 D azn] eA + Ye" 
i mn =m k 
m7 arD -V OE (1.4.75) 
a i m^ =M ^n * 
VA =g aem VEG — Yea" 
1 mn —Mny* 
oa Litwin +9; (iy*] . (7.4.76) 
From Eqs. (7.4.28) and (7.4.29), we then find the correlation functions 
1 
(OM) OG) = 5Re J Cope (Fela, A) + Gola.) 
£ 
1 
+yRe D Cane (F,R) — Guinfl)) , (7.4.77) 
1 
(UM) UG) = SRe J Cree (Fela f’) — GA) 
£ 
1 
+5Re = Cope (Fei) + Geli f’)), (1.4.78) 
1 
(OG) UG) = 51m > Cere — Fela. A) + Gulf, i) 
£ 
1 
+51m 2 Cop.( — Fy, i’) — Geli i!) , (7.4.79) 


where 


Fea’) =S VAVA , (1.4.80) 


Ge) = OVA" A) (7.4.81) 


23General formulas for the correlation functions are presented by M. Zaldarriaga, 
Astrophys. J. 503, 1 (1998). 
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Also, using Eq. (2.6.2) (with aem now written as arm) and Eqs. (7.4.27), 
(7.4.74) and (7.4.75), the correlation functions between temperature fluctu- 
ations and Stokes parameters are 


(AT (A) OW) = — $ Cree ReH, i) , (7.4.82) 
£ 
(AT® UD) = ` CTE IMH Â, f) , (7.4.83) 
£ 
where 
HAAN =Y YAVA, (7.4.84) 


m 
with Y;” the ordinary spherical harmonic. 
To calculate the functions Fe(ñ, ñ) and Ge, à), we note first (for 
instance, by inspection of the last line of Eq. (7.4.22)) that, for 7 in the 
three-direction Z with 6 = ¢ = 0, 


24+ 1 
4n ` 


Then, using the definitions (7.4.16) and (7.4.11) and the transformation rule 
(7.4.18), we can express J,” as an element of a unitary rotation matrix? 


YE = > D Oy att’, (Sw), (7.4.86) 


m 


VHG) = êm, —2 (7.4.85) 


where S (ñ) is the rotation (7.4.12) that takes 2 into 7. Using Eq. (7.4.57), it 
follows then also that 
MA = (DVA . (7.4.87) 
Hence, by using the group multiplication property D©(S)D©(S’) = 
D©(SS’) and the unitarity of D® (S), we obtain addition theorems” that 
give us the functions needed in our formulas (7.4.76)—(7.4.78) for the corre- 
lation functions 
i> Bey 20 + 1 (CA) 
Fei = DS al ASA ) (7.4.88) 


Gi, #) = (— pee ey Da (a) S(— fl). (7.4.89) 


24E, Newman and J. Penrose, ref. 3; J. N. Goldberg, A. J. MacFarlane, E. T. Newman, F. Rorlich, and 
E. C. G. Sudarshan, J. Math. Phys. 8, 2155 (1967); K. S. Thorne, Rev. Mod. Phys. 52, 299 (1980). 

25 General addition theorems for spin-weighted spherical harmonics are given by W. Hu and M. White, 
Phys. Rev. D 56, 596 (1997) [astro-ph/9702170]. 
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To calculate the function Ay(7,n’), we recall that the ordinary 
spherical harmonics can also be written as elements of a rotation matrix. 
From the formula Y;"(2) = /@€-+ 1)/42 ômo for the case 0 = ¢ = 0 and 
the transformation rule (7.4.18), we have 


Yâ) = oe De AOKO) . 


Hence, again using the group multiplication property of the D® matrices, 
we find 


HÂ, à) = T (SASA). (1.4.90) 
Note incidentally that 
Penny D O VGH) = FRA) , (7.4.91) 
and 
G:(—A, —i’) z2 A VAN = AA) . (7.4.92) 


Also, using the property YMA) = (—1)f Lea 


Ay(-f, A) = SOY" @* VTAN) = HEA). (7.4.93) 
Hence the correlation functions have the reflection properties 
(OA OH) = (QÂ) QÀ), (7.4.94) 
(UA) URAN = (UÂ) UÀ) , (7.4.95) 
(OA UA) = —(Q(-A) U(—A’)) , (7.4.96) 
(ATA) OAN) = (AT(—f) O(A), (7.4.97) 
(ATA) UAN) = —(AT(-f) U (A), (7.4.98) 


as expected from the space inversion properties of the Stokes parameters 
and temperature fluctuation and the assumed space-inversion invariance of 
the probability distribution over which we average. 

It may be noted that, in measuring the correlation functions of the Stokes 
parameters observed in directions 7 and vi’, observers commonly choose the 
system of polar coordinates so that à and 7’ are on the same meridian; that is, 
so that ø = ¢’. (This is usually expressed as the condition that the first of the 
two polarization vectors that are used to define the Stokes parameter should 
be aligned with the great circle between fi and f’,*° but we have defined these 


26K amionkowski, Kosowsky, and Stebbins, ref. 1. 
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two polarization vectors always to lie in the directions of increasing 6 and œ, 
so it amounts to the same thing.) This has the advantage that the functions 
Fñ, A), Ge, A) and He, n’) are then all real, and depend only on 0 — 6’. 
To see this, note that by using the representation 


D® (s) = exp ( — iJ) exp ( = iJ?) 


of the standard rotation from 2 to ñ in terms of the angular momentum 
matrices Ie, we can rewrite Eqs. (7.4.87)-(7.4.89) as 


Fe â) = “ | exp (105°) exp (95°) 


x exp ( — ig IP) exp ( — io?) | , 
D) 


2+1 
ENT, eth) O 
Gulf, fl) = CDE È (i015) exp (iI) 


x exp ( — ilġ' + mJ) exp ( — ila — ug) | ; 


5 
— 204-1 catty 4 7) 
iy — 
Ay(n,n’) = i | exp (i075 ) exp (io ) 
x exp ( — ip’ IL? exp ( — io IS? | 
(10) (0) 
If ¢ = ¢’, then 
Ta 2t +1 ; O) 
Fe, A) = [ep i(0 — 0J; , 
An ( ) 2-2 


so Fe(n,n') for d = o’ depends only on 6 — 6’. Also, Tii is a real matrix, 
so Fe(ñ, ñ) for o = ¢’ is also real. As to Gg, for ¢ = d’ we have 


241 
An 
x exp (10.139) exp ( — ixJ4?) exp ( — ilr - us?) | 


¿2L+1 
4r 


Gel, À) = (-1) 


= ae) 


=(-1) 
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x exp (i015) exp ( + ila — e's?) exp ( — int) | 


ae ya exp (i - o+ mag) , 
-2—2 


a, ey) 


so Ge(ñ, A) for o = ¢’ is real because gy? is a real matrix, and depends 
only on 6 — 6’. Finally, He (ñ, in’) is real and depends only on 6 — 6’, for 
the same reasons as for Fe (ñ, 7’). We conclude then that if the coordinate 
system is chosen so that ¢ = ¢’, the correlation functions (Qñ) Q(n’)) and 
Dae one a toa 
ñ and ni’, while (Q) U(v’)) and (ATA) U (A) vanish. Also, (O) ON 

and (U (ñ) U(n’)) are obviously symmetric between ñ and 7’, so for ¢ = "i 
they actually depend only on |8 — 6’|; that is, on the angle between fi and ñ’. 
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8 
The Growth of Structure 


In Chapter 6 we followed the evolution of small perturbations through 
the radiation and matter dominated eras, up to the time of decoupling, 
when radiation no longer interacted effectively with matter. Now we will 
continue the story past the time of decoupling. In Section 8.1 we will 
follow the perturbations while they remained small, continuing our lin- 
ear analysis. This era is increasingly becoming accessible to observation, as 
studies of the cosmic matter distribution are pushed to larger redshifts. 
As we shall see, data on the distribution of matter fluctuations already 
provides an important extension of results from cosmic microwave back- 
ground anisotropies to smaller wavelengths, and it is hoped that eventu- 
ally it may provide information of the effect of dark energy on cosmic 
expansion. 

Of course, eventually the perturbations in the matter density became 
strong enough for the linear approximation to break down, as shown vividly 
by the existence of stars and galaxies and galaxy clusters. It is believed that 
these structures were formed in a two-step process.! First, in regions where 
the density was a little larger than average, the cold dark matter and bary- 
onic matter together expanded more slowly than the universe as a whole, 
eventually reaching a minimum density and then recontracting. This sce- 
nario is discussed in Section 8.2. If an overdense region was sufficiently 
large then as shown in Section 8.3 its baryonic matter collapsed along with 
its cold dark matter. Then in a second stage, after this collapse, the baryonic 
matter lost its energy through radiative cooling, and it condensed into pro- 
togalaxies consisting of clouds of gas that eventually form stars. The cold 
dark matter particles could not lose their energy through radiative cooling, 
so they remained in large more-or-less spherical halos around these galaxies. 
We will not attempt a proper treatment of this second stage, which involves 
complications of astrophysics and mathematics that deserve a treatise to 
themselves. 


8.1 Linear perturbations after recombination 


After the disappearance of almost all free electrons the baryonic plasma 
decoupled from photons, and behaved like just another form of cold 
dark matter. (Effects of pressure at small wavelengths are discussed in 


1S, D. M. White and M. J. Rees, Mon. Not. Roy. Astron. Soc. 183, 341 (1978). 
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Section 8.3.) With the gravitational field perturbations dominated by fluc- 
tuations in the total density of cold matter, the Newtonian treatment of 
Appendix F is applicable, and tells us that the fractional density perturba- 
tion eventually grew as a œ t?/>. However, we need a relativistic analysis 
to connect the constant wave-number dependent factor in this growth with 
the strength of primordial fluctuations, characterized by the quantity Rs to 
describe certain small but interesting oscillations in the density fluctuations 
arising from the interaction of baryons and radiation before decoupling, 
and to carry this analysis forward to near the present, when vacuum energy 
became important. 

In our relativistic analysis baryons and cold dark matter are treated 
somewhat differently, because we continue to use a synchronous gauge in 
which the cold dark matter velocity potential but not the baryon velocity 
potential vanishes. After decoupling the baryon velocity potential was no 
longer locked to the photon velocity potential by Thomson scattering, so to 
obtain the equation for baryon conservation, we may use Eq. (5.3.34) with 
PB, ÔP Bq» and x 3 7 all vanishing (because baryons move slowly), and find 


2 
50 Bq + 3H d0Bq = 7 Pad By = — PB Yq > 


or, dividing by pg « a7?, 


gq — (q°/a’)bupg = -Wa (8.1.1) 


where as before, 53, = ôpgq4/PB. For cold dark matter there is no velocity 
potential, so we again have Eq. (6.2.11) for cold dark matter 
conservation: 


bpq = — W (8.1.2) 


It is only our choice of gauge that makes the velocity potential of the dark 
matter rather than the baryons vanish, so it is best to think of dugg as the 
relative velocity potential of the baryons and dark matter. 

The equation for baryon momentum conservation is given by 
Eq. (5.3.32) with zero pressure and anisotropic inertia, 


do(OBSUBq) + 3H Padus = 0, 
so, dividing by jg « a~>, we have simply: 
bupg =0. (8.1.3) 


(As we will see in Section 8.3, our neglect of the baryon pressure after 
recombination is justified except for the smallest wavelengths.) Finally, if 
we neglect the contribution of photon and neutrino density fluctuations 
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to the gravitational field perturbation, then the gravitational field equation 
(5.3.36) becomes 


d : 
Shea) = —4r Ga? m ((1 — B)5pq + Bsq) - (8.1.4) 
where £ is again the constant 
O Q 
fo. = wie, (8.1.5) 
pm 2m 


We are keeping terms of order £ for the present, because as we shall see they 
lead to effects that although small are quite distinctive. 

To derive the initial conditions for these equations, we note that at times 
sufficiently early (say, z > 1), at which vacuum energy as well as curvature 
made a negligible contribution to the expansion rate, we have a œ f7/7, and 
4r Göm = 2/317, so Eq. (8.1.4) gives 


d 


© (ya) = -wela — B)8pq + Bösq) . (8.1.6) 


The reader can easily check that the general solution of Eqs. (8.1.1)-(8.1.3) 
and (8.1.6) is 


Wq = agt 1 + bgt? + Bbup,q"/a’ , (8.1.7) 


3 
ÔBq = -544th + bit +O = p)c4 — 3(1 = B)t(q? /a”)5upa » (8.1.8) 


Dq = -Zag + bgt”! — Beg + 3Bt(q?/a*)bupy , (8.1.9) 
where dg, bq, Cq, and ŝu gq are constants that must be found by matching these 
solutions to the values of Wy, 5g, Dq, and dug, at the time of decoupling 
of matter and radiation. For late times, we need only keep the leading terms 
in this solution, with coefficient ag; the other terms are suppressed relative 
to these by factors 1~°/3, 1-7/3, and t7!. To calculate the coefficient ay, we 
note that B5gq + (1 — B)Spq — tWq + Bt(g?/a7)dupg = —Saqt?/>/2. Setting 
t here equal to the time ¢, of decoupling” (when to a good approximation 


2The subscript L stands for “last scattering,” and indicates the time of the decoupling of radiation 
from matter, associated with the recombination of hydrogen. (To use a subscript R or D for this time 
might produce confusion with the subscripts R and D that we continue to use to denote radiation 
and dark matter.) Of course, this is not the moment of last scattering; some photons of the microwave 
background were scattered again when hydrogen became reionized at much later times. Strictly speaking, 
we should take ty here as the time during recombination when a typical electron stops exchanging 
appreciable momentum with the photons, rather than the slightly earlier time when a typical photon 
stops exchanging appreciable momentum with the electrons. Because Ry is not very different from 
unity, there is little difference between these times. 
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dBq = ôyq and dugg = duyq) then gives for t > tz 


Spy(t) > SD > -Zag 
aye ee 
= ie (=) [Bnn + (1 — L)ôna (tL) — tL Yq ltL) 
HBD uyl] : (8.1.10) 


Note that, even though the fractional density perturbations of the baryons 
and dark matter were quite different at decoupling, they approached each 
other at late times thereafter, an assumption we made in analyzing the obser- 
vations of X-rays from clusters of galaxies in Section 1.9. It follows that the 
fractional fluctuation 6 jy, in the total mass density approaches 

_ PDÔDq + PBÔBq 


Ma = = 2 — dpg > dR (8.1.11) 
i PD + PB 4 2 


Since these fractional density perturbations all eventually became equal, 
we will concentrate from now on ômq(t). We now want to carry our calcul- 
ation forward to the present, when dark energy may no longer be neglected. 
According to Eqs. (8.1.1)-(8.1.4), under the approximation of neglecting 
fluctuations in the photon and neutrino energy densities, ô mq (t) satisfies the 
second-order differential equation 


d| d 2- 
P7 | Sön = 4n Ga pub - (8.1.12) 
The most important consequence ofthis equation, together with Eq. (8.1.10), 
is that whatever we assume about dark energy, well after recombination 
when the terms in ômq(t) that decay as 1/t have died away, the dependence 
of q(t) on q and t factorizes: 


Suq(t) = AQ FO (8.1.13) 
where 
A(q) = Pôya (tL) + (l — B)Spq (tz) 
-tL Va (tL) + BtL(q" [az )duyq(tt) , (8.1.14) 
and F(t) satisfies the differential equation 
S Segr] -troan (8.1.15) 
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with the initial condition, that well after recombination, until dark energy 
becomes important, 


2/3 
Fi) > 3(=) , (8.1.16) 


For instance, if we assume that the vacuum energy is constant, then the 
Friedmann equation for a(t) reads 


E a +m) = HoV 2a VI+ Ix, (8.1.17) 


3 


where 


3 
PA LR OA (2) . (8.1.18) 


PM Qum \ao 


Using x instead of ¢ as the independent variable allows us to put Eq. (8.1.15) 
in a parameter-free form 


Jx(1 +x) a (x OAs ro) = ate (8.1.19) 


6x1/3 


The growing solution? that becomes proportional to 7/3 for x « 1 is 


T =f du 
x 0 ul/6(1 + u)3/2 © 


Eq. (8.1.16) requires that for x < 1 


so we can write 


t Q DY 
Fj=2 (2 ’) C ~ (£) ) l (8.1.20) 
aL Qu \ ao 
where C(x) is a correction factor, normalized so that C(0) = 1: 


C(x) = 2x6 TF f (8.1.21) 


= a ie 


Numerical values of C(x) are given in Table 8.1. 


3H. Martel, Astrophys. J. 377, 7 (1991). 
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Table 8.1: Values of the function C(x), giving the suppression of the growth of matter 
fluctuations by dark energy as a function of x = (Qa /Qy)(a Ja}. 


x C(x) x C(x) 
0 1 1.0 0.8725 
0.1 0.9826 1.5 0.8314 
0.2 0.9667 2.0 0.7981 
0.3 0.9520 2.5 0.7702 
0.5 0.9256 3.0 0.7462 
0.7 0.9025 3.5 0.7254 


We see that at all scales and times, dark energy suppresses the growth of 
density fluctuations. 

It is A(qg) that contains information about conditions at decoupling. In 
place of A(q), it is conventional to introduce a power spectral function P(k), 
defined as a function of the present value k = g/ao of the physical wave 
number, by* 


P(k) = 27) aR F’ (to) |A(aok)/? . (8.1.22) 


Most surveys of large scale structure report their results in terms of P(k). 

Now we must consider how these surveys are used to measure A(q) or 
P(k). We recall that for a single dominant scalar mode (and assuming spatial 
flatness), the fractional density perturbation in coordinate space takes the 
form of a Fourier transform like Eq. (5.2.1): 


sux =f dg acing e™ = FW f Py aq ame™ (6.1.23 
where a(q) is a stochastic variable, normalized so that 


(a(q)a*(q’)) = 8q -q^ . (8.1.24) 


The quantity A (q) can be found either from measurements of the correlation 
of matter density perturbations at different points, or more directly, from 
an angular average of the square of a Fourier integral of the matter density 
perturbation over the survey volume. 


4The factor (27r)? is inserted here because the position-space perturbations are usually written as 
Fourier transforms with an extra factor of (22)~3/ 2 and the factor a is included because these three- 
dimensional Fourier transforms are usually written as integrals over the present value of the physical 
wave number, k = g/ag, rather than the co-moving wave number q. If the reader wishes, the co-moving 
coordinates could be normalized so that ag = 1, in which case all gs in this section could be replaced 


with ks. 
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According to the ergodic theorem discussed in Appendix D, as long as 
the survey volume V is large compared with the volume over which density 
fluctuations are correlated, the ensemble averages of products of density 
fluctuations can be found from an average of these products over the survey 
volume, In particular, the two-point correlation function can be found from 


1 
(uD duly.0) = 7 J EE E A E AIS) 
V 


Of course, we measure density fluctuations as functions of redshifts and 
angular positions rather than of three-dimensional positions and times, so 
the correlation function actually measured is the correlation of the fractional 
matter density perturbation observed at a redshift z and direction 7 with 
the perturbation observed at a redshift z’ and direction ñ’: 


Ez Âz AD = Z (revi, 2) Si (reni, e’ )) , (8.1.26) 


where r(z) and ¢(z) are the Robertson—Walker co-moving radial coordinate 
and emission time associated with a redshift z. Using Eqs. (8.1.23), (8.1.24), 
and (8.1.13), this is 


EE Miz A = f dq MGO Bgl tte) exp (ia: CEA -re 


= F(«())F(«2')) J dq 1A (q)? exp (iq OL r(2')i')) l 


(8.1.27) 
In terms of P(k), this reads 
tae. rho) F(t) 5 
E(Z,n;Z,n)= E FI) j k P(k) dk 
sin (klds@a — dsni! ) 
x— (8.1.28) 


lds (2) — dseni 


where ds(z) is a convenient structure distance, related to r(z) and to the 
angular diameter and luminosity distances at a redshift z by: 


ds(z) = aor(z) = (1+ z)da(z) = (1+2)7!dz(z) . 


Eq. (8.1.28) can be used to measure the power spectral function if we know 
ds(z), or to measure ds (z) if we know something about features in the power 
spectral function. In particular, for observations at relatively low redshift, 
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we can use ds(z) ~ z/Ho and t(z) ~ fo, in which case Eqs. (8.1.28) gives the 
correlation function as 


sin ( /Ho)|zô = zil) 


Ez Az AN = ely [ «Pw dk (8.1.29) 
272 0 


zâ —z/n' 


Measurements of the shape of this correlation function can evidently tell us 
about the dependence of the power spectral function on k/Ho, rather than 
k itself. 

It is more common in large surveys to measure P(k) from the angular 
average of the square of a Fourier integral of the matter density perturbation 
over the survey volume. Define a Fourier transform as an integral over the 
co-moving survey volume V: 

V 3 —iQ-x 
dug) = aan J, dxe om (x,t). (8.1.30) 
(The reason for this normalization will soon be made clear.) Using 
Eq. (8.1.23), this is 


SuQ(t) = J d’q a(g)Fy 4 — Qô m(t) ; (8.1.31) 
where 


Fy@ = (8.1.32) 


1 
are dx eX ý 
VV (27)3/2 | V 


It is plausible, and will be shown formally at the end of this section, that as 
long as the co-moving survey volume V contains many co-moving 
wavelengths 2x /q, the angular average of |ô Mol is the same as its ensemble 


average 
l 27 V 2 — 1 2/7 V 2 
ale ô |k] -=(z fa Ò |k] (8.1.33) 
Using Eqs. (8.1.31) and (8.1.24), this is 


1 A 2 1 i 
a feo howl =~ f Pofta Fra- QP puol. 


(8.1.34) 
Now, for large V, Fy (q — Q) approaches ((27)*/*/./V)5(q — Q), so 
2 3/2 
Fra- Q> E sa- Q)Fy(0) =a- Q, (8.1.35) 


Wid 
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and therefore 
l 2A eV : 2 
z fa Ô ra =|sug|° , (8.1.36) 


where QO = |Q]. 

Again, we measure density fluctuations as functions of direction and 
redshift rather than position and time. Using Eq. (8.1.30) in Eq. (8.1.36) 
and expressing the result in terms of the power spectral function (8.1.22), 
we have 


1 A x 
Pk) = zy”? [ Preko 


x d! aao EÂ, t(z)) eP Okds @) f (8.1.37) 
EN . 
where Y = a V is the physical survey volume. As in the case of Eq. (8.1.28) 
for the correlation function, we can use this formula either to calculate P(k) 
from data for surveys with relatively low redshifts, or to find ds(z) from 
larger redshift surveys combined with information about P(k) from other 
sources. 
Let’s now consider the calculation of A(q) and the power spectral 
function. We begin by neglecting all terms of order £ in Eq. (8.1.14), so that 


A(q) = ônq (tL) — tr q(t1) - (8.1.38) 


We will return at the end of this section to the very interesting effects 
associated with the small baryon density. Using Eqs. (6.5.5) and (6.5.6), 
this is 
39707 ROT (Kk) - 2g?R°ET (x) 
Aq) = —~4—_ = —4 y (8.1.39) 
2ay 3H; az 


where 7 is the dimensionless transfer function given in Table 6.1, and «//2 
is the ratio of the wave number q to the wave number qgo that comes into the 
horizon just at matter—-radiation equality. (The second expression is derived 
using tz = 2/3H,, where Hy = /QyHo(1 + 2)?” is the Hubble rate at 
decoupling if radiation is neglected. This is more accurate than using the 
actual age of the universe at decoupling, because Eqs. (8.1.38) and (8.1.39) 
were derived using a definition of the zero of time for which a œ #?/3 during 
the matter-dominated era.) Using Eq. (8.1.20), the power spectral function 
(8.1.22) is then 


4(27)3agC?(Qa/Qu) 
2590, He 


P(k) = Rig kT? (WV 2k/keg) (8.1.40) 
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where, according to Eq. (6.4.58), 
keq = q6Q/a0 = V2HpQu/VQR = [13.6 Mpc} !Qyh? . (8.1.41) 


In particular, if we suppose that Ry ~ Nq~?/?(q/qx)"5~)/? with some 
spectral index ns and constant N (which for ns 4 1 depends on the arbitrary 
choice of the reference wave number q), we would have 
427)? N? C?(Qa/Qmu) 
2 pr4,ns-l 
ISL Hiks 


P(k) = k"ST?(/2k/keg), (8.1.42) 


where ką = qx/ao. The shape of this function for ng = 1 is shown in 
Figure 8.1. 

Knowledge of the power spectral function allows us to calculate the mean 
square value o? of the fractional density fluctuation: 


a; e 
2n? \ F(to) 


2 poo 
o7(z) = Ẹ(Z, ñ; z, ñ) = ) / P(k) k* dk , (8.1.43) 
0 

With P(x) given by Eq. (8.1.42), for any plausible ns this integral is conver- 
gent at k = 0, where T(x) — 1, but for ns > 1 it diverges at k — œ like 
{k"s~? In? k dk. (Recall that, as we saw in Section 6.5, 7 (<) œ ln k/k? for 
k — oo. On the other hand, the sine factor in the integrand in Eq. (8.1.27) 
makes &(z, ñ; z’, À’) finite for z’ £ z or A Æ À.) 

In order to avoid the ultraviolet divergence in o°, it is common instead 
to express the intensity of the primordial fluctuations in terms of the mean 
square value oR of the average of the fractional density perturbation over a 


KT)? 
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Figure 8.1: Shape of the power spectral function. This plot gives kT 2(«) as a function 
of x, where T(x) is the transfer function discussed in Section 6.5, k = V2k/kgQ, and 
keg = [13.6 Mpc]~! Q h? is the wave number that just comes into the horizon at radiation- 
matter equality. Eq. (8.1.42) shows that for ng = 1, the power spectral function P(k) is 
proportional to kT 2(K). 
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sphere of co-moving radius R/ao: 


2 
Die. 3a 3 
o2(z)= (E faat T) ) . (8.1.44) 


(Because of the translation invariance of the average, it would make no 
difference if we wrote the argument of ô my as x+y, with y any fixed coordinate 
vector.) Again using Eqs. (8.1.22) and (8.1.23), and now also Eq. (8.1.20), 
we find 


2 


C(24/2u A +2} oo 
ETO ea | POSER ak, 
2n(1 + z)? C(24/2m) 0 


(8.1.45) 


where f (kR) is the top hat distribution function 


3a3 . 
raaa J d3x elxa0 — (sin kR—KR cos(kR)) 
ao|x|<R 


3 
4r R? (kR)? 
(8.1.46) 
In particular, if we take P (k) to be given by Eq. (8.1.42), then 
leny? C?(2a/2ud +2?) 


V A (+2)? 


op(z) = 
x i 7 \T(V2k/keg)|? F(R)? k?t"S dk. (8.1.47) 
0 


The top hat function has f(0) = 1, so there is no change in the infrared 
convergence of the integral for the mean square fluctuation, but |f (kR)|? 
decays as 9 cos?(kR)/(KR)* for k — oo, which is fast enough to remove the 
ultraviolet divergence for ng < 5. 

Many observations of the distribution of matter in the universe are 
commonly expressed in terms of a quantity called og, which is the value of 
or(z) for z = 0 and R = 8h"! Mpc. In calculating og, we can evaluate 
the transfer function using the Dicus fitting formula (6.5.12), with the 
Eisenstein—Hu baryonic correction (6.5.22). Using the parameters A = 0.72, 
Qyh? = 0.14, Qh? = 0.024 and |N]? = 2.1 x 107!° which as discussed in 
Section 7.2 were found at the end of the first year of WMAP observations, 
and taking the present radiation temperature as Ty, = 2.725 K which yields 
Qrh? = 4.15 x 1075, Eq. (8.1.47) gives og = 0.92, in agreement with the 
result og = 0.919 found by the WMAP collaboration, which was calculated 
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using the CMBfast computer program.’ (The three-year WMAP data® 


gives a smaller value, og = 0761) oe: because the reduced optical depth 


inferred from three-year polarization data requires a smaller value of |N]? 
to give temperature correlations about the same magnitude. The reduction 
in |N]? also yields a corresponding decrease in the value of og calculated 
from Eq. (8.1.47).) 

Even though Eq. (8.1.43) gives a divergent integral for ns > 1, it sug- 
gests that P(k)k? can be used as a measure of the strength of the fluctuations 
of co-moving wave number q = aok. With P(x) given by Eq. (8.1.42) with 
ns = 1, we have F?()P(k)k? x tt kt|T (k/kgo)|?. This isa monotonically 
increasing function of k, so we can conclude that it is the perturbations of 
large co-moving wave number and hence small mass that become strong first, 
with small condensations merging into pre-galactic dark matter haloes and 
then ultimately into clusters of galaxies. This “bottom-up” picture of struc- 
ture formation is the reverse of the “top-down” picture long advocated by 
Zel'dovich,’ according to which very large condensations form first and 
then fragment into condensations on the scale of clusters of galaxies and 
finally into individual galaxies. The bottom-up picture is supported by the 
observation that the commonest galaxies are dwarf spheroidals, and that 
our galaxy and the Andromeda Nebula M31 each have about 20 smaller 
satellite galaxies. 

Returning now to the power spectral function, inspection of Table 6.1 
shows that the function «|7 («)|? has a maximum value of 0.74, reached at 
k = 2.0. Since Eq. (6.4.58) gives k proportional to x, with k = Qyh’«/ 
19.3 Mpc, P(k) for ns = 1 has a maximum at this value of x, 
corresponding to 


kmax = 0.10 Qyh? Mpc™!. (8.1.48) 


Taking C(Q4/ 2m) = 0.767 and ng = 1, the value of the spectral function 
at its maximum is 


—l 
Pmax = 7.2 x 10! (2u?) INI? Mpc? . (8.1.49) 


5D. N. Spergel et al., Astrophys. J. Suppl. 148, 175 (2003). The result og = 0.9 quoted 
in this reference was rounded off from og = 0.919, in order to reflect uncertainties in the input 
parameters. 

6D. N. Spergel et al., Astrophys. J. Suppl. Ser. 170, 377 (2007) [astro-ph/0603449]. 

TYa. B. Zeľdovich, Soviet Scientific Reviews, Section E: Astrophys. and Space Physics Reviews 3, 1 
(1984); S. F. Shandarin and Ya. B. Zel’dovich, Rev. Mod. Phys. 61, 185 (1989); and earlier references 
cited therein. 
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There are a number of measurements of the matter distribution at low 
redshift that have been used to calculate P(k): 


¢ There have been several surveys of galaxy positions and redshifts, of 


e 


e 


which the two most recent and detailed are the Sloan Digital Sky Sur- 
vey,® which will include about 800,000 galaxies, and the 2dF Survey,’ 
which at its completion in 2003 had included 220,000 galaxies. Results 
of the Sloan Digital Sky Survey are shown in Figure 8.2. 


These two surveys give values of P (k) in fair agreement with each other, 
for values of k ranging from about 0.0154 Mpc™! to 0.24 Mpc™!. 
(Wave numbers are given in these surveys in units ofh Mpc™! because 
distances are inferred from redshifts, and are therefore proportional 
to h~!.) The measured P(k) has a shape consistent with the result œ 
k|T (k)|? expected from Eq. (8.1.42) forns ~ 1, and in particular seems 
to reach a maximum at k ~ 0.024 Mpc™!, in good agreement with 
Eq. (8.1.48) for Qm + 0.3 and h ~ 0.7. The value at this maximum is 
measured to be Pmax © 5 x 104 h47? Mpc?. Comparison of this result 
with Eq. (8.1.49) indicates a value |N| ~ 2 x 1075 for the strength of 
primordial fluctuations, if we take Qm ~ 0.3 and h ~ 0.7. 


The counts of numbers of “virialized” clusters of galaxies (like the 
Coma cluster) as a function of redshift gives information about the 
distribution of mass with distance.!° (See Section 1.11.) Their results 
yield values for og ranging from 0.66 to about 1, and can be interpreted 
as giving a value P(k) © 6 x 103 ~3Mpc? fork © 0.1 h Mpc7!, which 
falls on the curve provided by the above galaxy surveys. 


Other information about the distribution of mass with distance comes 
from correlations between the positions of intergalactic regions of 
higher than average density, revealed through Lyman « absorption 
of light from distant quasars that passes through these regions.!! (See 
Section 1.10.) Their results yield values for P(k) for k between about 
0.1 h Mpc! and 6h Mpc™!, which lie on the curve provided by the 


8D. G. York et al., Astron. J. 120, 1579 (2000); M. Tegmark et al., Astrophys. J. 606, 702 (2004) 
[astro-ph/0310725]; Phys. Rev. D 69, 103501 (2004) [astro-ph/0310723]. The latest data release at the 
time of writing is analyzed by W. J. Percival et al., Astrophys. J. 657, 645 (2007) [astro-ph/0608636] and 
M. Tegmark et al., Phys. Rev. D 74, 123507 (2006) [astro-ph/0608632]. 

IW. J. Percival et al., Mon. Not. Roy. Astron. Soc. 327, 1297 (2001); M. Colless et al., Mon. Not. 
Roy. Astron. Soc. 328, 1039 (2001); M. Colless et al. (the 2dFGRS team), astro-ph/0306581. The final 
data set is analyzed in S. Cole et al. (the 2dF-GRS Team), Mon. Not. Roy. Astron. Soc. 362, 505 (2005) 
[astro-ph/0501174]. 

10For a summary with references to the original literature, see Table V of M. Tegmark et al., Phys. 
Rev. D 69, 103501 (2004). 
1LU, Seljak et al., Phys. Rev. D 71, 103515 (2005). 
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Figure 8.2: Measurement of the power spectral function by the Sloan Digital Sky Survey, 
from W. J. Percival et al., astro-ph/0608636. Dark circles show values of W? P(k), inferred 
from the survey of galaxy positions and redshifts, with distances calculated from redshifts 
using assumed cosmological parameters Qyy = 0.24, Qa = 0.76. Vertical bars indicate 
1 —o errors. The solid curve is calculated using linear perturbation theory, with cosmo- 
logical parameters taken from the WMAP third-year temperature and polarization data: 
h = 0.73, Qy = 0.24, Qn = 0.76, QR/Qy = 0.174. The normalization of this curve is 
taken from a fit to the data for k between 0.01 A Mpc7! and 0.06h Mpc7!. The departure 
of the data from this theoretical curve for large k is attributed to non-linear effects on the 
growth of perturbations. The inset shows the effect of baryon acoustic oscillations, discussed 
at the end of this section. Data points give the ratio of the measured power spectral function 
to its value when smoothed to eliminate the oscillations. The solid curve shows the expected 
ratio, calculated using parameters from the third year WMAP data. 


galaxy surveys for k < 0.2 h Mpc™!, and extend this curve to larger 


values of k, again in agreement with the expected shape « k|T(k)|?. 


¢ Yet more information about the distribution of mass comes from weak 
lensing of galactic images.!? (See Section 9.5). Their results yield 


!2See, e.g., H. Hoekstra et al., Astrophys. J. 647, 116 (2006). 
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values of og ranging from 0.67 to 0.97, and can be interpreted as 
giving values of P(k) for k in the neighborhood of 0.3 h Mpc™|, that 
fall on the curve provided by the galaxy surveys. 


These measurements are subject to various uncertainties, which raise 
highly technical issues of astrophysics. In relying on redshifts to give dis- 
tances, these measurements are vulnerable to complications arising from 
the peculiar velocities of galaxies or galaxy clusters or Lyman a clouds. 
The theory used to interpret these measurements assumes that the con- 
centrations of dark matter can be treated as small perturbations, so it 
is necessary to avoid using data for values of k where fluctuations have 
become non-linear. Finally, in interpreting the distribution of baryonic 
matter in galaxies or clusters of galaxies or Lyman a clouds in terms of 
the total mass density, it is necessary either to assume that this introduces 
no bias, or else to know what the bias is. Only the weak lensing tech- 
nique directly measures fluctuations in the total mass density. For this 
reason, although the other techniques can give good information about 
the shape of the function P(k), they leave its overall normalization rather 
uncertain. The results from the galaxy surveys quoted above are for “no 
bias,” that is, with the distribution of baryonic matter before perturba- 
tions become nonlinear assumed to trace the distribution of cold dark 
matter. 

One of the striking things to emerge from the study of the cosmic 
microwave background described in the previous chapter is that these results 
for P(k) obtained from studies of large scale structure agree with the strength 
of primordial fluctuations found from the cosmic microwave background. 
That is, the primordial fluctuation strength PIR? seems to be roughly 
constant over a wide range of physical wave numbers k = q/ao, from the 
values ~ 1073? Mpc! probed by observations of the cosmic microwave 
background down to all but the smallest multipole orders, to values more 
than 1 Mpc™! probed by studies of large scale structure. Indeed, even before 
the advent of the COBE measurements discussed in Section 2.6, the study of 
the large scale structure of matter had led to the expectation that the cosmic 
microwave background would show fractional temperature fluctuations of 
order 1075, corresponding to |N| ~ 1075. 

The measurements discussed so far were all at small or moderate red- 
shifts, where the structure distance in Eq. (8.1.28) is not very different from 
the linear approximation ds(z) ~ z/Hpo. It is widely hoped that measure- 
ments of the matter correlation function £ at larger redshifts will provide 
a determination of the functional form of ds(z) (or equivalently of d4(z)) 
beyond this approximation, which could illuminate the time dependence 
of the vacuum energy. But Eq. (8.1.28) shows that in the integral for 


E(z,n;z',n’) it is only wave numbers k ~ 1/r for which the integrand is 
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sensitive to the value of r = |ds(z)a — ds(z’)n'|. If r « 1/kgo (where 
kgo is the wave number that comes into the horizon just at radiation- 


matter equality) then for k x 1 /r and ns ~ 1, the approximate for- 
mula (8.1.42) for P(k) together with the asymptotic formula (6.5.11) for 
the transfer function give P(k) ~ k~3[Ink + O(1)[’, so the part of the 
integral (8.1.28) for the correlation function that depends on r has the 
r-dependence 


J “ sin(kaor)[Ink + 00), 
l/r 


which varies only logarithmically with r. Thus in order to use measure- 
ments of the matter correlation function to learn something about the 
z-dependence of ds(z), we need either to carry the measurements of the 
correlation function to redshifts and angles that are sufficiently different 
so that |ds(z)n — ds(z’)’’| is at least of order 1/kgEQ = 19.3 Mpc/ 
Qyh’, or take advantage of the small departures of the transfer function 
from the asymptotic formula (6.5.11) for k > kgo, or else take advan- 
tage of small departures from our formula (8.1.42) for P(k) at 
k > kgo. 

Such departures from Eq. (8.1.42) for P(k) are provided by terms of 
order 6 = Qg/Qy, produced by baryon acoustic oscillations before the 
time of decoupling.!? To see the effect of these oscillations, we must return 
to Eq. (8.1.14), and now look at the “fast” terms in the various perturbations, 
which oscillate with wave number for large wave number. In estimating the 
order of magnitude of these terms, we take pg(tz) and py (tz) to be of the 
same order of magnitude, so that R(t) is of order unity, and 87 Gp, (tz) 
is of order 6 H*(tz). In estimating orders of magnitude we also do not 
distinguish between ay and ago, which only differ by a factor of order 3. 
Then, keeping track only of the small factors 6 and a, Hz/q, Eqs. (6.4.45)— 
(6.4.48) show that, aside from a common factor Rọ exp(— f Idt), the 


13The influence of baryon acoustic oscillations before decoupling on the matter distrib- 
ution after decoupling was recognized by P. J. E. Peebles and J. T. Yu, Astrophys. J. 162, 815 (1970); 
R. A. Sunyaev and Ya. B. Zel’dovich, Astrophys. Space Sci. 7, 3 (1970); J. R. Bond and G. Efstathiou, 
Astrophys. J. 285, L45 (1984); J. A. Holtzmann, Astrophys. J. Suppl. 71, 1 (1989); W. Hu and 
N. Sugiyama, Astrophys. J. 471, 30 (1996); D. J. Eisenstein and W. Hu, Astrophys. J. 496, 605 (1998). I 
believe that the first to suggest using redshift surveys to measure d4(z) by observation of baryon acoustic 
oscillations were D. J. Eisenstein, W. Hu, and M. Tegmark, Astrophys. J. 504, L57 (1998), and that the 
suggestion to use this method to measure the evolution of dark energy is due to D. J. Eisenstein, in Next 
Generation Wide-Field Multi-Object Spectroscopy, eds. M. Brown and A. Dey (ASP Conference Series, 
vol. 280, 2002): p. 35. 
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fast terms in Bd,,(tL), (1 — B)dpq(tz), and tL Yq(tL) are of the order of 
B, BH7a7/q°, and BHyaz/gq, respectively, while Btr (q7/az)duyq(tz) is of 
order f (q/aL H2), and is therefore dominant (aside from Silk damping) for 
q/aLH > 1. Keeping only this term, we see that the oscillating part of 
A(q) is approximately 


AMSG) ~ Btr (g /az dur (tr) 


2BqR; tL 

~ — Tdt | si d ; 
VBa Hı (1 + R,)3/4 oe ( | ) = (1 nar) 

(8.1.50) 


where as usual R = 36g/(,; dy is the acoustic horizon distance (7.2.39) at 
decoupling 


d tL dt 
=a — 
3 Lf aJ30 + R) 


_ 2 In J1+ Rr +./REQ+ Rt 
Ap 3RLQu (1 + 213? 1+/REQ 


(8.1.51) 


and, for reasons explained earlier, we have replaced tz in Eq. (8.1.14) with 
2/3HL. Thus the ratio of P(A) to the smooth curve given by Eq. (8.1.42) 
will have bumps at k >~ wky/2, 3uky/2, 5aky/2..., where ky = 
1/dy (1 +z). These bumps have been seen in the spectral function inferred 
from observations by both the Sloan Digital Sky Survey!* and the 2dF 
Galaxy Redshift Survey.! (See the inset in Figure 8.2.) The propor- 
tionality of dy to Oe * has allowed a determination of 2m by matching 
the observed and predicted positions of these bumps!*!>. The result!® is 
Qu = 0.2565) as or with a larger sample,!? Qq = 0.24 + 0.02. Unfortu- 
nately, these measurements are at moderate redshifts (0.16 < z < 0.47 for 
the Sloan survey, and z < 0.3 for the 2dF Survey), and although they give 
evidence for dark energy, they do not yet provide information about its time 
dependence. 


14D, J. Eisenstein et al., Astrophys. J. 633, 560 (2005) [astro-ph/0501171]. 
155, Cole et al., Mon. Not. Roy. Astron. Soc. 362, 505 (2005). 

16W, J. Percival et al., Astrophys. J. 657, 51 (2007) [astro-ph/0608635]. 
17M. Tegmark et al., Phys. Rev. 74, 123507 (2006) [astro-ph/0608632]. 
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x k k 


As promised, we close by estimating the cosmic variance incurred in 
using Eq. (8.1.33). The mean square fractional error in this formula is 


aroo =| [52 (broo - (Prol )] ) 


x (O w pees ee 
< Mlok cal P| )- ( a, (ool) 


(8.1.52) 


Assuming Gaussian statistics, this is 


aC 
E E, oo |Z Q 


<| woo” mao ij f 100 by oo) | 


Using Eqs. (8.1.31) and (8.1.24) (and recalling that w(q) = a*(—q) and 
ômq(t) is real) then gives 


aC ZC / 
— ESE 


qFv(q — QO) Fý (q — QO) Six, 


The large volume limit Fy (q — Q) > ((27)3/2/./V)63(q — Q) then gives 
the cosmic variance as 


2(27)3 26 20 
avian = S EE [£2 7300-00). 


Now, F?(QO — QQ’) takes the value V/(Qn) for QJÔ — O'| « V~"3, 
and vanishes exponentially for Q|Q — Q'| > V~"/?, so aside from factors 
of order unity, 


Av (Q,t) © (8.1.53) 


1 
O2V273 ` 


Thus the mean square fractional error in using Eq. (8.1.33) vanishes as 
Q-?V~-2/3, and becomes negligible when the survey volume contains many 
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wavelengths. The numerical factor depends on the shape of the survey 
volume. For instance, if the survey volume is a sphere of co-moving radius 
R, then for OR > 1, 


Ay(Q,t) > (8.1.54) 


ee 
402 R2 ` 
8.2 Nonlinear growth 


The study of the growth of perturbations beyond the linear approximation 
presents formidable mathematical difficulties. It is usually pursued by the 
use of computer simulations, which are beyond the scope of this book. To 
allow an analytic treatment, we can adopt a generalization of an idealiza- 
tion of nonlinear growth originally due to Peebles.! With Peebles, we will 
consider a fluctuation to have an overdensity Apm (that is, a total density 
Om + Apm greater than the cosmic average py) that is uniform within 
a finite sphere.2 According to the Birkhoff theorem,’ the metric and the 
equations of motion of a freely falling test particle inside the sphere are 
independent of what is happening outside the sphere, and are therefore the 
same as in a homogeneous isotropic universe, described by a Robertson— 
Walker metric, with a density oj(t) + Apy(t), and a curvature constant 
that is not in general equal to the cosmological curvature constant K. Ina 
Robertson—Walker metric with curvature constant K, the scale factor a(t) 
satisfies the Friedmann equation (1.5.19): 


82 Ga?(t) 
3 


Likewise, the scale factor A(t) of the Robertson—Walker metric inside the 
fluctuation will satisfy a Friedmann equation: 


w(t) +K = (au $ pv) (8.2.1) 


82 GAZA) 
3 


where K + AK is the curvature constant of the interior metric. We are 
including in the total density both a non-relativistic mass density, and a 


RQ+EK+AK = (Go PAD pv) (8.2.2) 


1P. J, E. Peebles, Astrophys. J. 147, 859 (1967). 

2J, E, Gunn and J. R. Gott, Astrophys. J. 176, 1 (1972) took into account the infall of matter from 
outside a sphere of uniform overdensity, which amounts to treating the overdensity of the sphere and 
its surroundings as a step function of radius. A non-vanishing vacuum energy was incorporated in the 
Gunn-Gott model by H. Martel, P. R. Shapiro, and S. Weinberg, Astrophys. J. 492, 29 (1998), who 
considered fluctuations consisting of a ball with a uniform overdensity surrounded by a spherical shell 
with a uniform underdensity. 

3G&C, Sec. 11.7. 


421 


8 The Growth of Structure 


vacuum density py (not included by Peebles) which for want of any contrary 
evidence is taken to be time-independent. 

We will only consider the case of vanishing cosmological curvature,‘ 
K = 0. We are primarily interested in fluctuations that have a chance to 
stop expanding and recollapse to high density, so we shall take AK > 0. 
The total matter density obeys the conservation law 


Pau (t) + Apu) x A), (8.2.3) 
while the unperturbed density satisfies 
pu (t) xa (ft) . (8.2.4) 


With K = 0, the normalization of a is arbitrary; we will find it convenient 
to use Eqs. (8.2.3) and (8.2.4) to normalize a so that 


AO (du0 + Apu) = POMO . (8.2.5) 


In order to provide initial conditions for this problem, we must first 
consider times that are sufficiently early in the matter-dominated era so that 
Apm (f) and AA(t) = A(t) — a(t) can be treated as small perturbations, and 
the vacuum energy may be neglected. As we saw in Section 1.5, in this case 
Eqs. (8.2.1) and (8.2.4) have the solution 


1 
2/3 - 
Es, = 8.2.6 
ax pM = = op (8.2.6) 


To first order in the perturbations around this solution, Eq. (8.2.2) 
becomes 


24AA+ AK = 


82 Ga? py (= Sen) 
+ — 7 
3 a PM 


or, using Eq. (8.2.6) 


AK 4 (2 Apu) 4 AA 
a £ a ÖM 3ta ` 


Also, to first order in perturbations, Eq. (8.2.5) gives at early times 


4The case of negative cosmological curvature is considered by B. Freivogel, M. Kleban, M. N. 
Martinez, and L. Susskind, J. High Energy Phys. 0603, 039 (2006) [hep-th/0505232]. 
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so AA is governed at early times by the first-order differential equation 


AK 4 AA 4AA 
=— — : 8.2.7 
a? 92 a 3t a ( ) 
This has solutions for t > 0 of the form AA « 4/7 and AA « t7!/3, 
Assuming that enough time has passed for the solution AA œ t~!/? to die 
away, we have AA œ ¢4/3, and so 


20... 407G .. 
AK =— lim aA) = - lima? (Apm). (8.2.8) 
t—> t—> 
In characterizing the initial strength of a fluctuation within a co-moving 
radius r, we note from Eq. (8.2.8) that at early times Apjy(t) « a7? (t) œ 
TA (£), so we can define a time-independent quantity, which we shall call 
ie initial fluctuation strength: 


Ap? (t 
pai Pu (8.2.9) 
t—>0 Pry (t) 
Then Eq. (8.2.8) can be written 
L G 
AK = L2 ROP O 
_ 406 
ATE #(O(du() + Apu) “ol (8.2.10) 


Note that we do not have to take the limit £ —> 0 here, because ap, is 


time-independent. 

Now let’s consider the development of the fluctuation at later times, when 
it can no longer be treated as a small perturbation. Using Eq. (8.2.10) allows 
us to write the Friedmann equation (8.2.2) (with K = 0) as 


82 GA? 
9 


A= [Sau + Apu + pv) — 5m + ao?e] 
or, using Eq. (8.2.3), 
d 2 
(öm + Apm)? (eu + Apu?) = 81G 
x (300 + Apm + pv) —5(6m + Apa)? p 2 . (8.2.11) 


The right-hand side vanishes at a total mass density py + Apm = pe 
satisfying 


3(pe + pv) = 502P py. (8.2.12) 
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For pe > 0, the quantity 5 pa? —3 ¢ takes values from —oo to 5009 /243, 
so Eq. (8.2.12) has a solution if aid only if the fluctuation is strong enough 
so that? 


2; 
= 500° 


As long as Eq. (8.2.12) has a solution, the density m (t) + Apm (t) will drop 
until it reaches the value p,, and then increase again to infinity. The total 
time elapsed for this expansion and collapse is given by Eq. (8.2.11) as 


ni Oe 9 dp 
VSTE Nip py 30 + pr) — 5025p” 


(The factor 2 appears here because it takes the same time to contract from 
the minimum density pe to infinite density as it does to expand from infinite 
to minimum density.) For example, in the limit p1 > py, this gives 


On 
53/2,/8 Gp 


If linear perturbation theory held up to this time then instead of becoming 
infinite, in this case the fractional density perturbation at time te would be 


Apu p\'? 47 21/3 243m2 \ 
EEM: ng pee = 62 Gt)! = = 1.686. 
PM (a) pie Ea ( 500 ) 


o > (8.2.13) 


(8.2.14) 


te > (8.2.15) 


We see that gravitational collapse occurs at a time when linear perturbation 
theory would predict a fractional density perturbation large enough to make 
obvious its own invalidity. 

Of course, different fluctuations will have different initial strengths p1 
and co-moving radii R. In order to make contact between this analysis and 
observation, it is convenient to make use of an approach due to Press and 
Schechter.® By inverting Eq. (8.2.14), we can calculate the minimum initial 
fluctuation strength )(t-) required for collapse at or before a time te. For 
instance, for fluctuations that are sufficiently strong to collapse before vac- 
uum energy becomes important, Eq. (8.2.15) gives p1 (te) = 812/1000G?2. 
To calculate the probability that a random point in space will be in a fluc- 
tuation this strong, we assume that at an early time t, before non-linearities 
become significant, the probability P, r(Apm) dApy that the average den- 
sity within a co-moving sphere of radius R is increased by a density excess 


5S. Weinberg, Phys. Rev. Lett. 59, 2607 (1987). 
Sw. H. Press and P. Schechter, Astrophys. J. 239, 1 (1974). 
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between Apm and Apm +dApy is given by the Gaussian distribution 


dApm Apiy 
P, rR(Apm) dApm = — exp|-— < u__ |. (8.216) 
CENY M CYM Seon edu) ( 202E) O 


For a Harrison—Zel’dovich spectrum of fluctuations with Rg = Nq~?/?, the 
standard deviation op for the fractional fluctuation Apm / py averaged over 
a radius R is given by Eq. (8.1.47), which for ns = 1 reads 
lory? C2(2/2u 1 +2) 
2 774 2 
25922, H4 (+2) 


okz) = 
| j IT (V2k/kgo)l? |f (KR)? k? dk . (8.2.17) 
0 


where f is the top hat distribution function (8.1.46); 7 is the scalar trans- 
fer function, defined and calculated in Section 6.5; and kgo is the wave 
number (8.1.41) that comes into the horizon at matter—radiation equality. 
Using Eq. (8.2.9), this can be expressed as a time-independent probability 
Pr(o1) dp, for fluctuations averaged over a co-moving sphere of radius R 
to have a initial strength between pı and p1 + dp): 


2/3 
dpi P| 
Pr(pi) dpi = exp | ->= (8.2.18) 
sino, \" 22 


where õp is a time-independent quantity, given by 


5R = lim Sy (He) oRE) 


Lox? By -4/3 [> 2 213 
= SO p~ T 2 R 
25 (sa) Po Í IT (V2k/keQ)l? If (KR)|? k? dk 
(8.2.19) 


Because we are averaging over a sphere of co-moving radius R, it is only 
fluctuations whose co-moving radii are greater than R that contribute to this 
probability. Integrating, we see that at early times, before the fluctuations 
become strong, the probability that a random point in space is in a fluctu- 
ation with initial strength greater than p; and co-moving radius greater 
than R is 


s im [ dp = p23 
> > = — a Caer 
i o 3V 21 p2/36R : 252 
al VERE 
=5 [i — erf (pi /~[2õr) | ; (8.2.20) 


425 


8 The Growth of Structure 


where erf (y) is the usual error function 


erf(y) = dx exp(—x7/2) . (8.2.21) 


1 [m2 
sN 27 To 
At early times the universe has uniform matter density, so Eq. (8.2.20) 
also gives the fraction of all matter that is in fluctuations with initial strength 
greater than pı and co-moving radius greater than R. If we now set p1 in 
Eq. (8.2.20) equal to the critical initial strength p1 (t) for collapse by a time 
t, then we find that at time t, when nonlinearities have become important, 
the fraction of all matter that is in collapsed structures with co-moving radii 
greater than R will be 


F(> Rt) = ; [1 3 erf (p1 0/V2őr) | l (8.2.22) 


The mass in a sphere of co-moving radius R is the time-independent quantity 
An öm (Ða (t) R? /3, so at time t the number density n(M, t) dM of collapsed 
structures with mass between M and M + dM is given by 


uO d 


1/3 2 
IM ang (01 (0/V 26x) 
1/3 


PiP O MO) iO) 
Z = , (8.2.23 
Mon DT T Cone 


where R(M) = (3M /42a3py)!/>. (Press and Schechter somewhat arb- 
itrarily multiplied this by a factor 2, to take account of the matter in regions 
with a negative density fluctuation.’) These collapsed structures eventually 
furnish the halos of cold dark matter that surround galaxies in the present 
universe.® 

In using Eq. (8.2.23), we need to know the mass dependence of oR(). 
For large masses and radii, the integral (8.2.19) is dominated by low values 
of the wave number k, for which the transfer function T (x) is close to unity, 
so Eq. (8.2.19) gives  « R~*, and so Gy) x M~*/?. (For a primor- 
dial fluctuation spectrum Ry œ q4+”s)/2, we would have a factor k?+" dk 
in place of k3dk in Eq. (8.2.19), which would give oO œx R~3-"S, and so 


n(M,t) = 


zai 
d RM) 
dM 


7For a derivation of this factor of 2, see J. R. Bond, S. Cole, G. Efstathiou, and N. Kaiser, Astrophys. 
J. 379, 440 (1991). 

8The effects of non-spherical collapse on the mass function n(M, t) is considered by P. Monaco, 
Astrophys. J. 447, 23 (1995); J. Lee and S. F. Shandarin, Astrophys. J. 500, 14 (1998); R. K. Sheth and 
G. Tormen, Mon. Not. Roy. Astron. Soc. 308, 119 (1999); R. K. Sheth, H. J. Mo, and G. Tormen, 
Mon. Not. Roy. Astron. Soc. 323, 1 (2001). 
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FRM œx M-G+s)/3.) The argument of the exponential in Eq. (8.2.23) in 


this case is thus proportional for ns = 1 to M*/, giving a rapid fall-off of 
the number density for large mass. On the other hand, for small masses and 
radii, the integral (8.2.19) is dominated by large values of the wave number 
k, for which according to Eq. (6.5.11) the transfer function T (x) falls off like 
Ink/k*. The integral (8.2.19) then varies only logarithmically with R, and 
so the number density n(M) goes for small M more-or-less like M~*. The 
detailed M-dependence predicted by Eq. (8.2.23) is in reasonable agreement 
with the results of large computer simulations of the evolution of cold dark 
matter.” 


8.3 Collapse of baryonic matter 


Until now we have supposed that the baryonic matter of the universe, which 
after recombination consisted chiefly of neutral hydrogen and helium, had 
negligible pressure. In this case baryonic matter just followed along with 
cold dark matter in its expansion and possible recontraction. Actually, as we 
saw in Section 2.3, the baryonic matter retained a small residual ionization 
even after the nominal era of recombination, providing enough electrons 
for Compton scattering of photons of the cosmic microwave background 
to keep the temperature of baryonic matter equal to the temperature of 
the microwave background until the redshift dropped below about 150. 
Small overdense regions did not have enough of a gravitational field to 
overcome the baryonic pressure, so their baryonic matter did not collapse 
along with the cold dark matter. This led to relatively small clumps of cold 
dark matter that now do not contain galaxies, and are therefore undetectable 
except for their gravitational effects. The question is, how small did a clump 
of matter have to be for its baryonic component to have resisted gravitational 
collapse? 

Before the existence of cold dark matter was generally accepted, this 
question was addressed in a simple theory due originally to James Jeans.! 
According to this theory, small perturbations either oscillate or grow 
according to whether their wave number is greater or less than a criti- 
cal wave number ky = /427Gpg/v; (where vs is the speed of sound) so 
a clump is too small to collapse if its mass is less than the Jeans mass, 
given by 6g(27/k,)>. This theory naturally (given its date) was not orig- 
inally set in the context of an expanding universe, but not much changes 


ov, Springel et al., Nature 435, 629 (2005) [astro-ph/0504097]. 

1J, Jeans, Phil. Trans. Roy. Soc. 199A, 49 (1902), and Astronomy and Cosmogony (2nd ed., first 
published by Cambridge University Press in 1928; reprinted by Dover Publications, New York, 1961), 
pp. 345-350. 
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when the expansion is taken into account.” But as we shall see, it turns 
out that results are quite different when we include the effects of cold dark 
matter. 

We are considering only non-relativistic matter during an era in which 
radiation contributed little to the gravitational field, so we can apply the 
Newtonian cosmological theory described in Appendix F. By following the 
same reasoning that led there to Eqs. (F.15) and (F.16), but now including 
both baryonic matter with a squared sound speed 3pg/3pg = v2 and cold 
dark matter with zero pressure, we find that the velocity potential perturba- 
tions dug and dup and the density perturbations 60g and épp for co-moving 
wave number q are governed by the equations of continuity 


dépp 


+ 3H 8p =a lip q-sup = 0. (8.3.1) 
dé 
= L3H ipa apy op =O: (8.3.2) 
and the Euler equations 
déup 4r Ga 
nt Hou = -5 [50 + 508 (8.3.3) 
ddup 4r Ga ve 
+ Hôug = Ea F2 ôs] — s Spp, (8.3.4) 
dt apB 


As in the relativistic case, it is convenient to introduce the fractional density 
perturbations 6, = 50n/n. Using the Friedmann result that both unper- 
turbed densities 6, go as a7? œ t~*, and eliminating the velocity potentials, 
these equations then become 


4. 
5 —§ ô 1 — B)6 8.3.5 
p+ 5p = jal Bôn]. (8.3.5) 
vi 4. 2a 
ô —ôB = ô ô 1— B)dp], 8.3.6 
B+ 7 6B = — 3558 + 55 = [B55 + ( Bôn | (8.3.6) 
where œ and £ are defined by 

3 2 2/2 2:2 rs Q 

ac WO A. BEE EE OI, 37) 

2a? 4r Gma? PM QM 


and py = Pp + pp. 

Note that œ was constant during the era (roughly for z > 150) when 
baryonic matter had the same temperature as radiation, because v? « T « 
a~'. Thus we can find power-law solutions? to Eqs. (8.3.5) and (8.3.6) that 


>G&C, Sec. 15.9. 
3 Approximate power law solutions were given by P. J. E. Peebles, Astrophys. J. 277, 470 (1984). 
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apply during this era. We set 
bp xt” 7 op=Edp, (8.3.8) 


where v and & are time-independent (but q-dependent) quantities to be 
determined. Eqs. (8.3.5) and (8.3.6) then become 


4v 2 
vw-)+5 = 5[65 ie p)] (8.3.9) 


4v 2a 2 
vv —1)+ 3 + iis seta B/E] , (8.3.10) 
Eliminating € yields a quartic equation for v. This generically has four 
different solutions, so there are four independent power-law solutions of 
the fourth-order system of differential equations (8.3.5) and (8.3.6), which 
therefore form a complete set of solutions. 

The general solutions of Eqs. (8.3.9) and (8.3.10) are too complicated 
to be illuminating, but we can find useful approximate solutions if we take 
into account the small value of £. In the limit of very small £, there are two 
baryon-poor solutions for which £ < 1: 


1 
=2/3, = 8.3.11 
v=2/3, €= 75 (8.3.11) 
1 E ! (8.3.12) 
V = — = iD 
, l+a 
and two baryon-rich solutions for which € > 1: 
1 l 2a l+a 
Sy eee ee : 8.3.13 
ee BG B B eee! 


All of these solutions decay with time, except for the first solution (8.3.11), 
in which the baryon and cold dark matter fractional density perturbations 
both grow as 77/3. This solution therefore dominates at late times. To first 
order in £, the power-law exponent and baryon fraction in this mode are 


_ 2pa l$! pa? 
51 +a)’ os l+a (+` 


v=2/3 


The fractional baryonic corrections to v and € are a maximum for very 
short wavelengths, for which they take values 38/5 ~ 10% and 6 ~ 17%, 
respectively. 
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There is nothing here like a critical wave number marking a transition 
from growth to oscillation, as in the classic one-component Jeans theory. 
The growing mode (8.3.11) grows for all wave numbers, with a growth rate 
that for small 6 depends very little on wave number. This of course is 
because the pressure of baryonic matter could not prevent clumps of cold 
dark matter from becoming increasingly denser than average. What does 
depend on the wave number is the fraction £ of the baryons that follow the 
growing condensation of the cold dark matter. This can most conveniently 
be expressed in terms of the total mass M (dark and baryonic matter) in a 
cubic physical wavelength 27a/q: 


2 3 

VENT (=*) (8.3.14) 
q 

According to the formulas for £ and @ given by Eqs. (8.3.11) and (8.3.7), 

the fraction of baryons that collapsed along with the cold dark matter can 

be written 


60B/PB 
—=é§= ; 8.3.15 
Sonidp > T F MY ads 
where My is a sort of Jeans mass 
1\3/2 v2 
M; = (=) 5. (8.3.16) 
Pu 


Baryons collapsed along with cold dark matter for clumps of mass much 
greater than Myj, while clumps of mass much less than My are largely free 
of baryonic matter. 

The speed of sound for a gas of hydrogen and helium atoms at 
temperature T is vy = (5kgT/3umy)!/*, where u is the mean molecu- 
lar weight, which for a helium abundance by weight of 24% is u = 1.22. 
Since we are considering an era in which T = Ty, and T. 5 /em equals its 


present value T o/PM 0, the Jeans mass can be written 


5rkgTy o)? __ 
Mal eee Pury = 2.02 x 105 (2uh?) ~! Mo (8.3.17) 
3umy G 


For Rmh? = 0.13, this is 6 x 10° Mo, corresponding to a baryonic mass 
BM; ~ 10° Mo. 

After the redshift dropped to about 150, the Compton scattering of the 
cosmic microwave background by the residual ionized hydrogen no longer 
kept the temperature of baryonic matter equal to the radiation temperature. 
According to Eq. (1.1.23), the kinetic energy and hence the temperature of 
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the hydrogen and helium atoms then dropped like a~?. Additional baryons 
then began to fall into the already growing clumps of cold dark matter, 
until the baryons were heated again by the energy released in gravitational 
collapse, and eventually by the first generation of stars. The increased 
baryonic temperature inhibited the further accretion of baryonic matter, 
and may have resulted in the expulsion of some baryonic matter from the 
clump. This is all quite complicated,* but it does not change the conclusion, 
that a clump of cold dark matter whose mass is less than Mz will not contain 
a full complement of baryonic matter. 


4P R. Shapiro, M.L. Giroux, and A. Babul, Astrophys. J. 427, 25 (1994). 


431 


This page intentionally left blank 


9 


Gravitational Lenses 


In 1979 Walsh, Carswell, and Weymann! noticed a pair of quasars at the 
same redshift, about 1.4, separated by just 6”. The similarity of the spectrum 
of the two quasars suggested to them that there was really just one quasar, 
now known as Q0957+561, split into two images by the deflection of light in 
the gravitational field of an intervening massive body. This suggestion was 
verified with the discovery of a galaxy with a redshift 0.36 between the lines 
of sight to the quasar. Such gravitational lenses had already been studied 
theoretically by many authors, and now a serious search for them was put 
in train.? Many more lensing galaxies were discovered, generally elliptical 
field galaxies, and also some lensing clusters of galaxies, which generally 
produce arc-like images.* 

Strong gravitational lensing has been used to search for dark objects, to 
explore the structure of galaxy clusters, and to measure the Hubble constant. 
Weak lensing offers great promise in measuring the correlation function of 
density fluctuations. Lensing is a large subject; in this chapter we will give 
only an overview of its cosmological applications. 


9.1 Lens equation for point masses 


We will first consider the gravitational lens provided by a point mass, and 
later take up more detailed models. To analyze the splitting of images by a 
point mass, suppose that the lines from the earth to a point source and the 
earth to the lensing mass in a Robertson—Walker coordinate system centered 
on the earth are separated by a small angle a. (See Figure 9.1.) 

This is the angle that there would be between the images of the source 
and lens, if their were no gravitational deflection of light. Because of the 
deflection of light by the lens, there is a different angle 6 between the actual 
images of the source and lens. We need to derive a lens equation, which gives 
the relation between 6 and a. 

In the coordinate system centered on the earth, the light ray from the 
source follows a path from the source to the neighborhood of the lens that 


Ip, Walsh, R. F. Carswell, and R. J. Weymann, Nature 279, 381 (1979). 

2For a comprehensive review, see Gravitational Lenses, by P. Schneider, J. Ehlers, and E. E. Falco 
(Springer-Verlag, Berlin, 1992). A more recent review is given by G. Soucail, Proceedings of the XX 
Texas Symposium on Relativistic Astrophysics, Austin, December 2000. 

3R, Lynds and V. Petrosian, Bull. Amer. Astron. Soc. 18, 1014 (1986); G. Soucail, B. Fort, Y. Mellier, 
and J. P. Picat, Astron. Astrophys. 172, L14 (1987); G. Soucail et al., Astron. Astrophys. 191, L19 (1988). 
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Figure 9.1: Quantities referred to in the derivation of the lens equation, (9.1.5). The bent 
solid line is the path of a photon from the source S past the lens L to the Earth E. The point 
X is the apparent position of the source. The transverse distances b and c from the lens to 
the light path and from the source to its apparent position are greatly exaggerated, as are 
the angles a, 6, and y. This is drawn for the case K = 0; in a Robertson—Walker coordinate 
system centered on the Earth, the path from the source to the bend would be curved for 
IK] #0. 


is curved for K Æ 0; is bent by the gravitational field of the lens; and then 
follows a straight line to the earth. The proper distance b of this path from 
the lens at its closest approach (which we assume to be much less than the 
cosmological scale 1 /Ho) is 


b = Bd4(EL), (9.1.1) 


where d4 (EL) is the “angular diameter distance” of the lens as seen from the 
earth. As discussed in Section 1.4, in general the angular diameter distance 
d4(PQ) of a point Q as seen from a point P is the ratio h/@ of a proper 
length h at Q (normal to the line PQ) to the angle 6 subtended at P by this 
length. It is given by 


d4(PQ) = a(to)rp (Q) , (9.1.2) 


where rp (Q) is the radial coordinate of Q in the Robertson—Walker coor- 
dinate system centered at P (in which rays of light received at P all travel 
on straight lines), and fg is the time the light leaves or arrives at Q. From 
Figure 9.1, we can see that the line segment SX from the true to the apparent 
position of the source has a proper length c given by 


y d4(LS) =c= (£ g a) da (ES) , (9.1.3) 
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where y is the angle of deflection of the light ray near the lens as seen from 
the lens, given by general relativity as* 
4MG 
a, 9.1.4 
5 (9.1.4) 
and d4(LS) and d4 (ES) are the angular diameter distances of the source 
from the lens and from the earth, respectively. From Eqs. (9.1.1), (9.1.3) 
and (9.1.4) we have 


> yd4(LS) b = 4MGd4(LS) _ 9 
G —a)p ~ d4(ES) d4(EL) d4(ES)d4(EL) PE - GLa 


This is our lens equation. It is a quadratic equation for £, the two solutions 
giving the directions of the two images into which the point source is split. 
All the effects of the large scale spacetime geometry and the expansion of the 
universe are contained in the angular diameter distances d4(EL), d4(ES), 
and d4(LS). (In general these are independent distances; for K 4 0 we do 
not have d4(ES) = d4(EL) + d4(LS).) 

The lens equation (9.1.5) has two roots, 


2 
pe =Sts>+62. (9.1.6) 


The angle « is not observed, because we cannot remove the lens. If all we 
measure is the angular separation between the two images of the source, 
then all we can learn is an upper bound on the mass of the lensing galaxy: 


[B+ — BI? > 46} . (9.1.7) 


For instance, if d4(EL) = d4(LS) = 100 Mpc, d4(ES) = 200 Mpc, and 
the sources are separated by 1”, then (remembering that MoG = 1.475 km 
and 1 Mpc = 3.09 x 10! km), we find that M < 6 x 10° solar masses. 

On the other hand, if the lensing galaxy and the two images are all 
observed, we can measure the angles + between each image and the lensing 
galaxy, but with a unknown the best use we can make of Eq. (9.1.6) is to 
eliminate œ by multiplying the roots 


Repo Ser. (9.1.8) 


(The minus sign just means that the two images are on opposite sides of the 
lensing galaxy.) This allows us to calculate the mass M, but if the distances 


4G & C, Section 8.5. 
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are calculated from measurements of redshifts, then they scale with 1/Ho, 
so what we really calculate in this way is MGHpo. (As we saw in Section 
1.9, velocity dispersions and angular diameters also only tell us the value of 
MGA.) 

In the special case where the lensing galaxy lies directly on the line 
between the source and the earth, the problem has cylindrical symmetry 
around this line of sight, and we get an Einstein ring rather than a pair of 
images. The angular radius of the Einstein ring is the value of 6 given by 
setting a = 0 in Eq. (9.1.5), so it is just the angle Bg, which is why we label 
it with a subscript E. Einstein rings have been observed for a number of 
radio sources, starting with the source MG1131+0456.° 


9.2 Magnification: Strong lensing and microlensing 


The various images that are produced by a gravitational lens will not all 
have the same apparent luminosity. The apparent luminosity is the power 
received per receiving area, so now we need to consider light paths that 
end at various points on the telescope receiving area. For this purpose it is 
helpful to refer positions in the receiving area relative to some fixed point Y, 
which we can conveniently take on the axis of symmetry of the problem — 
the line extending from a point source (or a luminous point on an extended 
source) through the lens and past the earth. (See Figure 9.2.) We can 
think of the distance h of a point on the telescope mirror from this line as a 
function of 6, the angle at the source between the light ray to the point on 
the mirror and the fixed line (in the Robertson—Walker coordinate system 
centered on the source) from the source through the lens. The fraction of 
all light that is emitted between polar angles 6 and 6 + dé (with 0 « 1) 
and azimuthal angles ¢ and ¢ + dọ (measured at the source, around the 
fixed line to the lens) is 0 d0 dq@/4z, while the receiving area between in 
the rectangle with height dh and width hdo is hdhd®@, so the apparent 
luminosity is 


_ | Lod0d¢/4x 
-~ [hdhdo (1 + zs)? 


B L 
~ 4r (1+ zg)? 


a (9.2.1) 


a 


(The factor (1 + zs)~* accounts for the reduction of energy of individual 
photons and the reduction in the rate at which photons are emitted from 
the source.) From Figure 9.2, we see that 


h = d4(SE)x = d4(SE)d4(EL)a/d4(SL) , (9.2.2) 


5G. H. Chen, C. S. Kochanek, and J. N. Hewitt, Astrophys. J. 447, 62 (1995). 
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Figure 9.2: Quantities referred to in the derivation of the magnification formula, (9.2.4). 
The bent solid line is the path of a photon from the source S past the lens L to a point 
E on a telescope mirror. The point Y is a fixed point near the earth on the line from the 
source through the lens. The transverse distances b and A from the lens to the light path and 
from the point Y to the point E where the photon arrives at the telescope mirror are greatly 
exaggerated, as are the angles a, 6, x, and 0. As in Figure 9.1, this is drawn for the case 
K = 0; in a Robertson—Walker coordinate system centered on the earth, the path from the 
source to the bend would be curved for |K| £ 0. 


where x is the angle between the line from the source to the point on the 
mirror and the line from the source to the lens, and 


@ =b/d4(SL) = Bd4(EL)/d4(SL) . (9.2.3) 


(As in Section 9.1, for any points P and Q, d4(PQ) is the angular diameter 
distance of Q as seen from P.) Hence Eq. (9.2.1) gives 
d 
C= bo EE (9.2.4) 
ada 


where £9 is the luminosity that would be observed in the absence of the lens: 


L 
4m (1 +25)2d2 (SE) ` 


(Note that rs(E) = rz(S), so Eq. (9.1.2) gives d4(SE) = (1 + zs)d4(ES). 
According to Eq. (1.4.12), the luminosity distance of the source as seen 
from the earth is dy (S) = (1 + zs)?d4(ES) = (1 + zs)d4(SE), which is the 
distance whose square appears in Eq. (9.2.5).) 

Eq. (9.2.4) is a very general result, which applies to lenses and sources of 
all types. It has a particularly useful consequence for extended sources. Ifa 


(9.2.5) 


Lo 
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small part of the image of such a source subtends a solid angle B AB A@ and 
has an apparent luminosity AZ, then it has a surface brightness A€/B AB Ad. 
According to Eq. (9.2.4), this equals Afo/a Aa Ad, which is the surface 
brightness that this part of the image would have if there were no lens. 
Thus surface brightness is unaffected by lensing. This of course is because 
gravitational lensing changes neither the number nor the energy of photons. 

Let us now specialize to the case of a point lens. The lens equation (9.1.5) 
gives here 


a = $ — Br/B, (9.2.6) 
SO 

ada = _ 4 4 

gap 71 Belb (9.2.7) 


The luminosity (9.2.4) is then 
£0 


£= ——__—_.. (9.2.8) 
|1 — 6: /B4| 

When the distance! a of the lens from the line joining the source and 

observer is small, we havea < Bz, so the two solutions (9.1.6) for 8 become 


b+ > e+ ha. (9.2.9) 


In this case both images are amplified by a factor |6¢/2a|. On the other 
hand, when the distance a of the lens from the line joining the source and 
observer is large, we have a >> Bz, so the two solutions (9.1.6) for B become 


By —>a> pe, — B- > Bla X Be. (9.2.10) 


Under these conditions, the “—” ray becomes invisible, while the “+” ray 
has the normal brightness expected without gravitational deflection. 

Roughly speaking, therefore, a point mass can only produce noticeable 
strong lensing, with more than one image, ifithasæ < Bg, i.e., ifit lies within 
a proper distance dmax of the line between the source and the observer, 
given by 


4MGd 4(LS)d4(EL 
ana = Ba ( BL) = | MCHA AACE (9.2.11) 


'Hereaisa proper distance transverse to the line of sight, and is not related to the Robertson—Walker 
scale factor, which is distinguished in this section by always writing it with a time argument, as a(t). 
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Using Eq. (9.1.2), this is 


‘5 _ 4MGr_,(S)re(L)a(tt) = 4MGr_,(S)re(L)a(tz) 1 
aie re(S) z re(S) JIFZL 
(9.2.12) 


where fg is the time the light signal reaches the earth (elsewhere in this book 
called to), while zz is the redshift of the lens 


142, =a(tz)/a(tz) . (9.2.13) 


The radial coordinate rz (S) of the source in the Robertson—Walker coor- 
dinate system centered on the lens can be calculated in terms of the radial 
coordinates of both the source and lens in the Robertson—Walker coordi- 
nate system centered on the earth by using the condition that the three- 
dimensional proper distance between the source and the lens is equal in 


these two coordinate systems 

rL(S) rg(S) 
Í (9.2.14) 
0 re re(L) oe 


This gives 


ri(S) = rE(S)/ 1 — Kr} (L) — re(L),/ 1 — Kri (S) . (9.2.15) 


If at time ¢ there are n(t, M) objects per proper volume with mass between 
M and M + dM, then (using Eq. (1.1.12)) the total number of objects that 
can produce a detectable splitting of the image of the source S is 


re(S) 
Ns at | E dre Lath) [v= a>, (tr, M) dM 
1 — Kr? zL) 
re(S) drp(L) p(t) ri(S)re(L) 


0 a+ A-R "=O ” 


where tz is the time that the light from the source reaches the lens, and p(t) 
is the mass density of possible lenses at time t: 


= 4r Ga? (tg) 


(9.2.16) 


p(t) = f ra M)M dM . (9.2.17) 
0 


Also, in the integrand rg(L) and tz are related as usual by 


tE dt 


oe 9.2.18 
f sa -f a(t) ea 
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and 1 + zz = a(tg)/a(tr). If there is no evolution of the density of lensing 
galaxies, then p(t) = p(te)(1 + zz), and Eq. (9.2.16) reads 


TE(S) dyp(L) (1 + zz) (eve 
1 — Kr2.(L) rE(S) 


Ns = 4r Ga’ (tz) p (te) | ) (9.2.19) 
0 


Where Ns < | we can ignore the possibility of multiple lensing, and inter- 
pret Ns as the probability that the image of the source S is appreciably 
modified by a lens near the line of sight. 

For sources with zs « 1, Eq. (9.2.19) simplifies to 


, re(S) (re(S) —re(L))re(L) 
Ns = 4xGa%(tg)p(tz) Í drp(L) 
0 rE(S) 


_ WnGr;(S)a7(te)o(te) — 2xGzg pte)  Z%QL 
E 3 a): | re ae 


(9.2.20) 


where Qz = 8mp (tg)/ 3HE is the fraction of the critical mass that at present 
is provided by the lensing objects. We see that even for Qz as large as 
Qy ~ 0.3, the probability of strong lensing is small for nearby sources 
with zs < 1, but the probability should become appreciable for sources at 
cosmological distances.” 

For Zs of order unity or larger, the strong lensing probability turns out to 
depend sensitively on the cosmological model. Since the integral (9.2.19) is 
complicated in general, we will consider here just two extreme cases, of a flat 
universe dominated either by vacuum energy or by non-relativistic matter. 


De Sitter model (Q4 = 1, Qe = Qy = QR = 0) 


Here a(t)/a(tz) = exp (Ho(t — t(E))), so 


rg(L) = J £ = aT [exp (Hotte = t1)) = 1] 


We can invert this, and find 


a(te) 
1 + a(te)Hore(L) 


a(tL) = 


2W. H. Press and J. E. Gunn, Astrophys. J. 185, 397 (1973). 
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9.2 Magnification: Strong lensing and microlensing 
Then Eq. (9.2.19) becomes 


Ns = 4r Gp(te)d (tE) 
sg Í fe (e (L)(rg(S) — re(L)) 
0 


rE(S) 


It is convenient to change the variable of integration from rg(L) to the 
redshift zz of the lens, which is given by 


pe (£) EE SD 
a(t) 


) (1 + a(te)Hore(L))dre(L) . 


SO 


Ns = sn Gplz) k Gane — =) (1 + 21)d21 


He 0 ZS 
= wGp(te) 3 2\ _ QL 3 2 
5 era B +228) = (8 +223). (9.2.21) 


Einstein-de Sitter model (Q = 1, Qg = Qa = Qr = 0) 
Here a(t) = a(tz)(t/te)*/? and 1 + zz = (tz/t1)"/°, so 


E dt -1 2/3 ,1/3 
L) = — = 3a = 
rE(L) [ oO 3 (16) (te tp ty ) 


= 3a”! (te) te (1 atte ziy 
Hence Eq. (9.2.19) here becomes 


181 Gp (te) ty, T -1/2 -1/2 
Ns = dzı (1-1 2) a / 
Sraz Jo et ( eee K g 


x (a +2907"? -1 +23") 


= 82 Gp(te) (-4+ Vl+zs+1 in (1 +2s)) 


He J1+zs —1 
Jl+zs+1 ) 
= 3Q, | —4 + ————_In(1 + 9.2.22 
( s n(1 +25) (9.2.22) 


This grows like ra for zs < 1, in agreement with Eq. (9.2.20), but it flattens 
out for zs > 1, growing only logarithmically with zs. 
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We see that the probability of a given source being lensed increases 
much more rapidly with the redshift of the source for a vacuum-energy 
dominated model than for a matter-dominated model. For instance, for 
zs = 2, Eq. (9.2.21) gives Ns = 4Q 7 for a vacuum-dominated model, 
while Eq. (9.2.22) gives Ns = 0.30 Qz for a matter-dominated model. 

Comparison of lensing theory (under the assumptions K = 0 and con- 
stant vacuum energy) with the number density of strong gravitational lenses 
measured as a function of redshift by the Cosmic Lens All Sky Survey? has 
given the result that 


Qm = 0.314971 (68% stat.) Th) (syst.) . 


This is not yet competitive with other measurements of Qm, but it shows 
definite evidence of the effect of dark energy on the number of observed 
strong lenses. 

The assumption that the lens can be approximated as a point mass is 
appropriate for so-called microlensing by stars in our galaxy. Note that for 
d4(EL) 103 pe, d4(LS)  d4(ES), and MG ~ 1km ~ 3 x 107!* pe, 
the effective radius af 4MGd4(EL)d4(LS)/d4(ES) of the point mass lens 
is of order 1075 pc, which is much larger than the size of even a large 
star. Typically in microlensing observations a change is detected in the 
luminosity of a distant point source (such as a star outside our galaxy) as 
the star moves past the line of sight to the source. For a star that moves 
at 100 km/sec transverse to the line of sight, the time within which the 
source will be within the effective radius 107% pe of the star is about a 
month, which is convenient for monitoring changes in apparent luminosity. 
Between 13 and 17 microlensing events toward the Large Magellanic Cloud 
(LMC) have been seen by the MACHO collaboration,* and another three 
toward less crowded fields of the LMC by the EROS collaboration.» Of 
these, only four of the lenses have been identified, and all of them are in 
the LMC itself, rather than the halo of our galaxy. This suggests though 
it does not prove that the mass of the halo does not consist of dark objects 
with the masses of typical stars.’ Microlensing has also been extensively 
used in searches for dark stars and extra-solar planets by the Microlensing 


3K.-H. Chae et al., Phys. Rev. Lett. 89, 151301 (2002) [astro-ph/0209602]. 

4C, Alcock et al., Astrophys. J. 542, 281 (2000). 

ST. Laserre et al., Astron. and Astrophys. 355, L39 (2000). 

6N. W. Evans, in IDM 2002: The 4th International Workshop on the Identification of Dark Matter, 
eds. N. Spooner and V. Kudryavtsev (World Scientific) [astro-ph/021 1302]. 

7For discussion of this issue, see A. F. Zakharov, Publ. Astron. Obs. Belgrade 74, 1 (2002) [astro- 
ph/0212009]; K. C. Sahu, proceedings of the STSci symposium Dark Universe: Matter, Energy, and 
Gravity [astro-ph/0302325]. 
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Observations in Astrophysics collaboration® and the Optical Gravitational 
Lensing Experiment collaboration.” 


9.3 Extended lenses 


The point source model is not at all valid for clusters of galaxies, and it 
is only marginally valid for individual galaxies. (Note that for d4(LS) ~ 
d4(EL) ~ 10!° pe, d4(ES) ~ 2 x 10!° pe, and MG ~ 10!*km ~ 0.03 pe, 
the effective radius /4MGd4(EL)d4(LS)/da(ES) of the point mass lens 
is of order 5 x 104 pc, which is close to the size of the spherical halo of 
our own galaxy.) To deal with light rays that pass through the galaxy, it 
has become common to approximate the massive halos of these galaxies as 
spheres of matter in “isothermal” equilibrium, with a ratio of pressure p to 
mass density p given by the mean square value (v?) of any one component 
of star velocity, assumed to be equal throughout the lens: 


P(r) = p(r)(v’) . (9.3.1) 


We have already seen in Section 1.9 that the solution of the equation of 
hydrostatic equilibrium with this equation of state behaves at a large proper 
distance r from the center of the lens as 

2 p(0) (u?) 


p= 9(r/r0)? T In Gre’ a2) 


which gives the mass contained within a sphere of radius r as 


2r (v2) 
G 


The solution departs from this result for small r, but most of the mass of 
the lens is at distances from the center where it is a good approximation. 

The rate of change of the unit vector û giving the direction of a ray 
of light in a gravitational field g = —V¢ is given in the post-Newtonian 
approximation by! 


Mir) = 


(9.3.3) 


A = 2x xg). (9.3.4) 


8P, Yock et al, Proceedings of the ninth Marcel Grossman meeting, Rome, July 2000 [astro- 
ph/0007317]. 

9A. Udalski et al., Acta Astron. 54, 313 (2004) [astro-ph/041 1543]. 

1 See, e.g., G&C, Eq. (9.2.7). A factor —2 that was missing on the right-hand side of this equation 
has been supplied. 
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For any spherically symmetric non-relativistic distribution of matter, we 


have GMC) 
r 
g(r) => r3 r, (9.3.5) 
which for the outer parts of an isothermal sphere becomes 
2(v?) 
g(r) = — 2 r, (9.3.6) 


For a light ray that passes the center of the lens at a distance b of closest 
approach, the light ray direction û remains very close to a fixed direction. 
The small change in direction then has a magnitude 


© bdx 
y = 4?) T a An (v?) , (9.3.7) 
and is independent of b. (We are taking the speed of light to be unity, so that 
(v*)!/2 is dimensionless, equal to the rms velocity in units of the speed of 
light.) The direction of this change is toward the center of the lens, so light 
can arrive at the earth from the source along two different rays, which pass 
on opposite sides of the lens center, and are separated in angle by 87 (v’). 
The “+” ray, which passes the lens on the side toward the source, makes 
an angle 61 at the earth with the ray from the earth to the lens given again 
by Eq. (9.1.3), while the ‘—’ ray makes an angle with the earth—lens ray B_ 
(now taken positive), given by replacing œ with —« in Eq. (9.1.3), so 


yd4(LS) = (p+ F œ) d4(ES) (9.3.8) 
which with Eq. (9.3.7) gives us our lens equation 
Bs = +a + Be, (9.3.9) 
where now 
4r (v*)d4(LS) 
= —_____.. 9.3.10 
BE (ES) ( ) 


Here again for a lens on the line of sight from the earth to the source we 
have a = 0, and the two images become an Einstein ring with angular 
radius Bg. Because we have now defined £+ to be positive the lens equation 
gives two images only if a < Bg, which requires that the proper distance 
a = d,4(EL)a of the lens from the line between the earth and the source be 
less than a maximum value 

4r (v*)d4(LS)d4(EL) 


dmax = Bed4(EL) = LES) (9.3.11) 
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The magnification of the two images is again given by Eq. (9.2.4) 


pdp 
ada 


but now we must use this with the lens equation (9.3.9), which gives 


Bs 
Bs — BE 
The magnification becomes infinite as the lens approaches the line of sight 
between the earth and the source, in which casea —> 0, B+ —> Be, and 
the image pair becomes an Einstein ring. On the other hand, for a lens 
approaching the maximum distance (9.3.11) from the earth-source line, we 
havea —> Br, so p+ —> 2a and B_ — 0. According to Eq. (9.3.12), the ‘“—’ 
ray disappears in this limit, while the apparent luminosity of the ‘+’ ray is 
doubled. 

The number of possible isothermal spherical lenses that are actually able 
to split the image of a point source is now given by 


L= lo 


5 


L = bo l = t (9.3.12) 


rg(S) 
Ns= Í EO) drg(L)a(tr) a ALLS ee 
1 — Kr2.(L) 
re(S) 3 2 
z 162302? | s HED n(ty)a° (tL) (20) (93.13) 
0 re(S) 


\/ 1 — Kri (L) 


where n(t) is the proper number density of these lenses at time t. For sim- 
plicity, we have here taken (v?) the same for all lenses. If we also now take 
K = 0, so that Eq. (9.2.15) gives rs(L) = rg(S)—rg(L), and also assume no 
evolution of the population of lensing galaxies, so that n(t)a? (t) is constant, 
then 


re(S) ( (re(S) ~ re(L)) re) 


Ng=16n3(v2)2n(tg)a(te)3 Í tn 
0 re(S) 
_ 817 (v)’n(te)a(te) rz (S) 
T 15 
872 (v2)2n(tE) 
= Nee 3.14 
5H? F(zs), (9.3.14) 
where 
3 1+z dy 3 
F(z)= tr) H, S = . (9.3.15 
(2) = (a(te)Hore(S)) ( / aaa) (9.3.15) 
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This shows that the chance of a source being lensed depends strongly on 
Qa. For instance, for QA = Qe = 0 and Qy = 1 we have F(z) = 
8(1 — 1/ VI + 2)°, which approaches the constant 8 for large z, while for 
Qy = Qpr = l and Z; = 1 we have F(z) = z>. The lensing probability for 
a source at redshift z = 3 is 27 times greater if the cosmic energy density is 
vacuum dominated than if it is matter dominated. From direct observations 
of galaxies, the constant in Eq. (9.3.15) has been estimated as? 


822 (v2)2n(tE) 


= 0.02. 9.3.16 
15H} i ! 


so for Qm = Qr = Qand Qa, = 1 the probability of lensing should become 
large when z © 3.7. 

Lensing probabilities for quasars of large redshift can provide sensitive 
limits on Q,.3 At one time it was thought on the basis of early surveys that 
strong lensing statistics ruled out the possibility that a constant vacuum 
energy made a dominant contribution to producing a spatially flat universe.* 
Since then, many groups have carried out such studies, with corrections for 
finite core radii, selection effects, etc., to set limits on Q, in flat cosmologies 
with 24 + Qu = 1. They find Q4 < 0.9,5 Qa < 0.76 2a = 0.644)32,7 
Qn < 0.748 Qa < 0.79,9 Qa = 0.781,10 and Qa ~ 0.45-0.75.!! 

The deflection A0 calculated in the isothermal sphere model of 
gravitational lenses is relevant only if the light path actually passes through 
the sphere. For galaxies of relatively small mass, we may have one or both 
of the deflected rays passing outside the galactic radius R. The deflection 
of light depends only on the Newtonian gravitational potential, and, as 
Newton showed, the gravitational potential outside a spherically symmet- 
ric distribution of matter is just the same as if all the mass were concentrated 
at the center of symmetry, so the motion of a light ray outside the galaxy is 
described by the point mass lens model considered earlier. If the light ray 
passes far outside the effective radius of the galaxy we have so-called weak 
lensing, which is typically discovered statistically, through a correlation in 


2E, L. Turner, J. P. Ostriker, and J. R. Gott, Astrophys. J. 284, 1 (1984). 

3E, L. Turner, Astrophys. J. 242, L135 (1980). 

4E. L. Turner, Astrophys. J. 365, L43 (1990). 

5M. Fukugita and E. L. Turner, Mon. Not. Royal Astron. Soc. 253, 99 (1991). 

6D. Maoz and H-W. Rix, Astrophys. J. 416, 425 (1993) 

TM. Im, R. E. Griffiths, and K. U. Ratnatunga, Astrophys. J. 475, 457 (1997). 

ŠE. E. Falco, C. S. Kochanek, and J. A. Muñoz, Astrophys. J. 494, 47 (1998). 

A.R. Cooray, J. M. Quashnock, and M. C. Miller, Astrophys. J. 511, 562 (1999) [astro-ph/9806080]; 

A. R. Cooray, Astron. Astrophys. 342, 353 (1999) [astro-ph/9811448]. 

10M. Chiba and Y. Yoshii, Astrophys. J. 510, 42 (1999) [astro-ph/9808321]. 
Ny.c.N, Cheng and L. M. Krauss, Astrophys. J. 511, 612 (1999) [astro-ph/98 10392]. 


446 


9.4 Time delay 


the orientation of the images of several lensed galaxies, rather than by the 
study of individual sources. This is the subject of Section 9.5. 


9.4 Time delay 


In addition to light rays being bent by the gravitational fields of intervening 
objects, they are also delayed,! so that a fluctuation in the distant light 
source appears at different times on earth in the several lensed images of the 
source.” There are two effects here. 

First, there is a geometrical time delay, caused by the increased length 
of the total light path from the source to the earth. Since this arises over 
very long distances, it can be calculated by idealizing the light path as a 
geodesic of the Robertson—Walker metric from the source to the point P 
of closest approach to the lens, where the light path is bent, followed by a 
similar geodesic from that point to the earth. The time tg that a light signal 
that leaves the source at time fs arrives at the earth is given by 


E dt 
J —— = OSP + OPE (9.4.1) 
ts a(t) 


where osp and opg are the proper lengths (the integrals of (&jdx!dx/)!/7) 
along the paths from the source to P and from P to the earth, respectively. 
The time tgo that the light would arrive at the earth if there were no gravi- 
tational deflection is given simply by 


Eo dt 
EEN 9.4.2 
[ Fam (9.4.2) 


where ogg is the proper length along the geodesic from the source to the 
earth. Hence the geometric time delay (which is always very short compared 
to a Hubble time) is 


Atgeom = tg — tgo = alte) (osp +opg ose). (9.4.3) 
Now, it is easy to calculate proper lengths along geodesics that end at the 
earth: 
re(S) dr rE(P) dr 
OSE = = Sk. OPES a (9.4.4) 
I V1 — Kr? Í V1 — Kr? 


' The time delay of radar reflections from planets and of radio signals from artificial satellites caused 
by the gravitational potential of the sun has provided a fourth test of general relativity, as first proposed 
and measured by I. Shapiro, Phys. Rev. Lett. 13, 789 (1964). For a discussion, see G&C, Section 8.7. 

25. Refsdal, Mon. Not. Roy. Astron. Soc. 128, 307 (1964). 
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where rg(S) and rg(P) are the usual radial coordinates (in a Robertson— 
Walker coordinate system centered on the earth) of the source S and the 
point P of closest approach of the light to the lens. To calculate osp we 
have to do a little more work. By using Eq. (1.1.17) for the spatial affine 
connection Fy we see that the equation (1.1.26) of a spatial geodesic is 


d*x! 
To +Kx'=0. (9.4.5) 
The general solution is 
l A'coso + B' sino , K=41 
x(o)=}| A +Bo, — K=0 (9.4.6) 
A'cosho + B’ sinho , K=-1, 


where A’ and BÝ are constants characterizing different paths. These con- 
stants are subject to a normalization condition 


a 2 dxi dx! _ (dx 2 K (x -dx/do)* 
=i ee do 1—Kx2 ° 
which gives 

(1 — KA’)(1 — B?) = K(A-B)’. (9.4.7) 
We can determine the constants A and B for the geodesic from the source to 
the point P by requiring that x’(0) = rg(S)ñgs and x' (osp) = re(P)ÎÀEP, 
where figs and figp are the unit vectors from the earth to the source or point 
P, respectively. Imposing the normalization condition (9.4.7) then gives 


OSP 


cos7! [rePyre(S) cos + y1 —re(P)y¥1— reS? | K=+1 
= | \/r2(P) + 12.(S) — 2re(P)re(S) cos K=0 
cosh7! [—re(Pyre(S) cos6 + J/1+repPpyl + reS} | Kaci 
(9.4.8) 


where 6 is the angle between the directions from the earth to the source and 
to the point P. This result applies for a geodesic triangle with arbitrary 
angles, but of course we are interested here in the case where 6 is very small. 
In this case, Eq. (9.4.8) becomes 


re(P)rE(S)6" = 
CES — OEP + Jsin(ops—ER) mais 
= _re(P)re(S)07_ = 
OSP = ) OES — OEP + 2(rE(S)—re(P)) rE(P)) oe 
Gn Bape Pre O K=-1, 


sinh(ogs—ogfp) 
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In the terms proportional to 67 we can ignore the separation of the lens 
from the point P of closest approach, taking rg(P) = rg(L); 0 = B — a; 
and (using Eq. (9.2.14)) ozs — ogp = ors, with errors that would introduce 
terms of higher order in 0. Thus, in all three cases 


re(L)rg(S)0? 


EPER MEDE 9.4.9 
OSP = OES — OEP + (5) ( ) 


The geometric time delay (9.4.3) is then 


_ a(teyre(L)re(S) (B — @)* 
Afgcom = AO l (9.4.10) 


Using Eq. (9.1.2), we can write this in terms of angular diameter distances 
and the redshift of the lens: 
(1 + z2) da(EL)d4(ES) (B — œ)? 
Atgeom = : 
2d4 (LS) 


(9.4.11) 


The details of the lens enter here only in the lens equation, which for each 
image gives the unobservable angle a in terms of the observable angle £. 

There is also a potential time delay, caused directly by the motion of 
the light through the gravitational potential of the lens. The calculation 
of these time differences is generally done on a case-by-case basis, using 
detailed models of the lensing galaxy rather than either the point source or 
isothermal sphere models. Here we will consider only the case of a general 
spherically symmetric lens. In a coordinate system centered on the lens with 
line element in the ‘standard’ form 


dt? = B(r)dt* — A(r)dr? — r°d0? — r° sin? bdo? , (9.4.12) 


the time required for light to travel from a large coordinate distance r to a 
distance b of closest approach to the lens and then out again to r is? 


1/2 
‘i A(r)/B(r) 


2t(r, b) = 2f 
» \1-6/n?(BO)/BO)) 


For the weak gravitational fields that concern us here, we can use the post- 
Newtonian approximation,* which gives 


Ar) =1+2r¢'(r) , Bir) =1+26(7), (9.4.14) 


dr. (9.4.13) 


3G&C, Eq. (8.7.2) 
4G&C, Eqs. (9.1.57) and (9.1.60). It is necessary to redefine the radial coordinate in order to put the 
line element given by the post-Newtonian approximation in the form (9.4.12). 
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where ¢ (r) is the Newtonian gravitational potential. Then to first order in o 
-1/2 


: 2 
2t(r,b) = 2y r2? — b2 + 2 | ( S (7) (wo — 600) dr 
b 


r WAA ee 
+2 f (: = (7) (7) (om - #(0)) dr. (9.4.15) 
b r r 


We will be interested in this in the case r —> oo, because the effects of the 
finite distance between the lens and the source and earth are already included 
in the geometric time delay. The second integral in Eq. (9.4.15) converges 
for r > œ, but the first integral diverges. Noting that for any kind of lens 
of mass M, the Newtonian potential at large distances goes as —MG/r, we 
can put the limit of Eq. (9.4.15) for r > oo in the form 


2t(t, b) > 2r+2MGIn(2r) + f(b), (9.4.16) 


where 
1/2 


o0 2N T 
f(b) = -2MGInb+2 f ( z É) ) (wo ae jue P 
[00] b 2 —3/2 b 2 
vf (1- (7) (7) (00-00) dr. (9.4.17) 


The first two terms in Eq. (9.4.16) diverge for r — ov, but they are 
independent of b, so when we calculate the time difference between the 
arrival of fluctuations in different images of the source we need only to take 
account of the differences in f (b) for various values of b. But this would 
give the delay in the time used in the metric (9.4.12), which is the time told 
by clocks that are far enough from the lens to ignore the lens’s gravitational 
potential, but close enough to ignore cosmological effects. Because of the 
cosmological redshift, the time delay observed on earth is lengthened by a 
factor 1 + zz, so the potential time delay is 


Atpot = (1 + zz)f (b) . (9.4.18) 


For instance, if the lens is a point mass then ¢ = —MG/r, so the first 
integral in Eq. (9.4.17) vanishes and the second integral is independent of 
b, so the gravitational potential of the lens produces a difference in time 
between the arrival of fluctuations in images of the source at angles 6; and 
b2 to the image of the lens, given by 


Atpot(B1) — Atpot(B2) = 2MG(1 + zz) In(b2/b1) 
= 2MG(1 + zz) In(62/B}). 
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Evidently, the potential time delay will generally be of order MG. As 
shown for instance in Eq. (9.1.5), if d4(EL), d4(ES), and d4(LS) are all 
of the same order d, then for strong lensing the squared angle (6 — œ)? is 
generally of order MG/d, so the geometric time delay (9.4.11) is also of 
order MG. For a galaxy of 10!! solar masses this is 5.7 days. Measurement 
of this delay can be used to measure the mass of the lens, without needing 
to know the Hubble constant. We have seen that the measurement of the 
angles between the source image and the lens can tell us Hp MG, while the 
measurement of these angles and time delays can tell us MG, so the com- 
bination of angular and time-delay measurements can yield a value of Ho. 
Here is a list of several lenses that have been used in this way: 


* QSO 0957+561: Two images separated by 6.1”, zg = 1.41, zs = 0.36, 
with time difference 417 + 3 days (95%), gave Ho = 64 + 13 km sec™! 


Mpc7!,> subsequently recalculated® as 77*5) km sec—! Mpc7!. 


e 


B 0218+357: Two images separated by 0.335” and an Einstein ring, 
Zs = 0.96, zz = 0.68, time difference 10.5+0.4 days, gave Ho = 69773 


km sec! Mpe~!.? 


e 


PKS 1830-211: Two images separated by 1.0”, plus an Einstein ring, 
zL = 0.89, time difference 26+4 days (8.6 GHz). Source redshift 


measured as 2.507+.002, gave Hy = 651 km sec™! Mpc”! for Qm = 
1, 2a = 0, Ho = 164 km sec~! Mpc™! for Qm = 0.3, Qa = 0.7.2 


e 


B 1608+656: four lensed images, with three time differences from four 
to ten weeks, gave Hy = 75 km sec~! Mpc~! + 10%.!° 


A table of time delay measurements up to 2003 is given in the review article 
of Kochanek and Schecter.'! In 2006, a survey!” of time delays in ten 
gravitational lenses (under the assumptions that Qy = 0.3 and Qa = 0.7, 
and with a number of parameters for each lens found by requiring that all 
give the same Hubble constant) gave Ho = 1505 km sec~! Mpc7!. 


5T. Kundic et al., Astrophys. J. 482, 75 (1997) [astro-ph/9610162]. 
6G. Bernstein and P. Fischer, Astron. J. 118, 14 (1999). 
TA. D. Biggs et al., Mon. Not. Roy. Astron. Soc. 304, 349 (1999) [astro-ph/9811282. 
8J, E. J. Lovell et al., Astrophys. J. 508, L51 (1998) [astro-ph/9809301] 
°C. Lidman et al., Astrophys. J. 514, L57 (1999) [astro-ph/9902317]. 
10L, V. E, Koopmans, T. Treu, C. D. Fassnacht, R. D. Blandford, and G. Surpi, Astrophys. J. 599, 70 
(2003). 
11C, S. Kochanek and P. L. Schecter, Carnegie Observatories Astrophysics Series, Vol. 2, ed. W.L. 
Freedman (Cambridge University Press, 2003) [astro-ph/0306040]: Table 1.1. 
12P, Saha, J. Coles, A. V. Maccio’, and L. L. R. Williams, Astrophys. J. 660, L17 (2006) [astro- 
ph/0607240]. 
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9.5 Weak lensing 


For cosmology, the most promising application of gravitational lenses 
probably lies in surveys of weak lensing, the study of the distortion of 
the images of distant galaxies by numerous small deflections of light as 
it passes from the galaxies to us through a slightly inhomogeneous distri- 
bution of matter. Consider a ray of light from a point source on a distant 
galaxy at co-moving coordinates rg, where ñ is a unit vector. Whether 
the intervening lenses are galaxies or clusters of galaxies or concentra- 
tions of intergalactic matter, we can think of the total deflection of the 
light ray as the sum of deflections caused by encounters with point lenses 
L of mass mz, at positions xy. (Since we will be considering only terms 
of first order in the total deflection, we can think of the lenses as indi- 
vidual particles, even if they are aggregated into extended objects.) For 
weak lensing, the angle wz between the directions to the source and the 
lens is much larger than the “Einstein ring” parameter Ber for lens L, so 
as we have seen there is only a single image, given by taking the + sign in 
Eq. (9.1.6), which with a < Bg gives the amount of the deflection caused 
by the lens as B47, —ay = Br /æaL. We decompose the co-moving lens 
coordinate vector xz into components parallel and perpendicular to the 
light ray: 


XL =TLÂ+YL, rL=ħ-XL, Yr =x_—n(n-xz). (9.5.1) 


The lens L deflects the light ray in a direction —jz, so the total deflection 
caused by all the lenses is 


A x ~ 4M,Gd,(Ls) 

2 LGd4 
An = — ` ÝLÊEL/QL = — ` I TES ED eo (9.5.2) 
L L 


where for any points P and Q, d4(PQ) is the angular diameter distance 
of Q as seen from P. According to the definition of the angular diameter 
distance d4 (EL) of the lens as seen from the earth, the angle between the 
directions to the source and the lens is «z = azlyz|/d4(EL), where az is 
the Robertson—Walker scale factor at the time that the light passes the lens 
L. Also, it is convenient to use Eq. (9.1.2) to write d4(LS) = a(ts)r(r, rs) 
and d4(ES) = a(ts)rs, where ts is the time the light leaves the source, 
and r(rz,rgs) and rs are the radial coordinates of the source in Robertson— 
Walker coordinate systems centered on the lens and the earth, respectively. 
According to Eq. (9.2.15), 


r(rL,rs) = rsy1 — Kr? — rill — Kr : (9.5.3) 
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The deflection is then 


4M 
AD y 4MLG r(TL, rs) (9.5.4) 


L 
2 
T rsaL\yL| 


This is not in itself a useful result, because we do not generally know 
where the image of a point on the source would be if there were no lenses. It 
is more interesting to consider the variation of the deflection with position 
of the ray origin. Suppose we consider a small change 0 in the undeflected 
direction 7 to the ray origin, with 0 perpendicular to 7 and |0] « 1. The 
change in the vector yz from the light ray to lens L at the point of closest 
approach due to the displacement of the source is then 


yL = —O A- XL) — ÂO xL) 


Dropping the term proportional to ñ, we see that to first order the change 
in the deflection of the image normal to the line of sight is 


Aba = ) Malts, Â) 9 . (9.5.5) 
a 


with a and b running over two orthogonal directions normal to 7, and 
Mab(rs, ñ) the 2 x 2 shear matrix for images of a source at a distance rs: 


` 4MLGr(rL,rs)rL ( ab  2YLaYLb 
Mats: =Y | -= {= (9.5.6) 
T rsaL lyz| lyz| 
It is conventional to write this matrix as 
M= ( EPG S ) , (9.5.7) 
y2 Kk — yı 


with « called the convergence and y; known as the shear field. 
We may conveniently rewrite the shear matrix in terms of the perturb- 
ation to the total Newtonian potential due to the lenses:! 


MLG MLG 
yens ey 1/2 
fe eee T ay ((r—ri(o? + O- yL?) 
(9.5.8) 


!We can neglect the effect of lenses at a cosmological distance from the light ray, so the denominator, 
including the factor az, is the proper distance to the lens L. We are here representing the complete 
perturbation to the mass density as a set of lensing point masses, but since these may be particles as well 
as galaxies, this introduces no loss of generality. 
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where x = rñ + y withn-y = 0. (We are here explicitly taking into account 
the fact that the lenses may have a time dependence, as for instance from 
peculiar motion or cosmological evolution. In Eq. (9.5.6) we are using the 
abbreviations 


rL =rv(tr,), YL = Ji Ui) 


where f, is the time that a light wave that reaches us at the present moment 
was at a radial coordinate r, given by r = Hi dt/a(t).) In particular, the 
second transverse derivatives of the potential on the light ray are 


32 i MG ôa 
; 5 Sp (rù +y, J = y = b = 
TE yo T % \ (=r)? + OP) 


E 3Y La HYLLE) 
5/2 
(© -rL + yz?) 


$ 


; —3/2 —5/2 
The functions (=r +yz) and (C-ri)? +yz) are sharply 
peaked at r = rz, so the integral of this second derivative times any smooth 
function f(r) (smooth, in the sense that it varies little when r varies by an 
amount of order |yz]) is 


2 
[ro dr l 4 ôp(rů+y, J 


dyad Yb y=0, (=, 
= pp aaa (= Siaa) 
T aL lyzl? lyzl J 


Comparing this with Eq. (9.5.6), we see that the shear matrix M,, for a 
source at radial coordinate rg is 


TS r(r,rs)r a? 
rs 3Yaðyp 


Ma (rs, ñ) = af ôp(rů+y, J dr. (9.5.9) 
0 


y=0, t=t, 


So measurement of the shear matrix can yield information about pertur- 
bations to the gravitational potential by masses al a the line of sight. In 
particular, by contracting the indices a and b, we see that 


1 rs ; 32 
«= tM = | AE V? — — | do(ra+y,t) dr. 
2 0 rs or 


y=0, t=1, 
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If the lensing is due to a collection of bodies (such as a cluster of galaxies), 
all at about the same radial coordinate rz, then d¢@ falls off rapidly for large 
values of |r —rz|, so that the factor r (r, rs)r can be replaced with r(rz,rs)rz, 
and the integral of 9°5@/d7r can then be set equal to zero. Using the Poisson 
equation a~7V76@ = 4G Spy, we can write the resulting expression in 
terms of the density fluctuation doy: 


_ 4n Ga (t, )d4(LS)d4(EL) 
_ d4(ES) 


the integral being taken over a range of r passing through the lens. Hence, 
assuming that we know the angular diameter distances appearing in 
Eq. (9.5.10), a measurement of the value of « for sources seen in some direc- 
tion can tell us the total mass density of a cluster of lensing masses that lie 
along that line of sight, projected onto the plane normal to the line of sight. 

Now let’s consider how the lensing of the images of galaxies can be 
used to measure the shear matrix. In order to deal with galaxies that are 
irregularly shaped, it is convenient to describe the shape of a galaxy by a 
quadrupole matrix: 


dpm (rn, t,, )a(ty, ar , (9.5.10) 


_ [PO LO), 

Qab = "F P6 LO) 
Here the integral is over the transverse displacement 0 of the direction of 
points on the image of the galaxy from some central point; the indices a, b, 
etc. run over two orthogonal directions in the plane of this image; and £ 
is the surface brightness—the apparent luminosity per solid angle—of the 
image. (In order for the integral in the numerator to converge, it may be 
necessary to replace £L by some function of £, such as one that equals £ 
when £ is above some threshold brightness, and otherwise vanishes. This 
has no effect on the following analysis.) As remarked in Section 9.2, the 
surface brightness of any point is the same as would be seen in the absence 
of lensing from the same point on the source, so 


(9.5.11) 


c(a + M)6) = c (e) , (9.5.12) 
where £“ is the surface brightness of the source in the absence of lensing. 


Introducing a new variable of integration 6’ in Eq. (9.5.11) by writing 6, = 
(1 + M)qp6;, and using Eq. (9.5.12), we have 


J d’O L(0)0a0p = (1 + M)ac(1 + M)pa Det (1 + M) J d*6' £L5(0')0/0', , 
and likewise 


J d*6 L0) = Det(1 + M) J d’o' £L°(6') , 
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The determinant cancels in the ratio, so (dropping primes) we have 
Qab = (1+ M)ac(1 + M)ba Qia , (9.5.13) 
where Q* is the quadrupole matrix in the absence of lensing 


, [PO £50) 6.6 
Qa = TPE EO) 


(9.5.14) 


For simplicity of presentation, we will limit ourselves here to the case of 
lensing that is sufficiently weak so that all elements of M have |Ma « 1, 
the case of greatest interest in cosmology. In this case (9.5.13) becomes 


Qab = Qp + MacQ?p + Mab Qia - (9.5.15) 


Of course, we would not know the unlensed quadrupole matrix for any 
particular galaxy. But if we have a sample of galaxies in about the same 
direction and with about the same redshift, so that the shear matrix is the 
same for all the galaxies in this sample, then we can learn something about 
this shear matrix by making the reasonable assumption that the orientations 
of the galaxies are uncorrelated. There are at least two ways that this can 
be done. 


Standard method? 
In the standard method, one considers the ellipticity matrices 
Xab = Ow/TrO , X*, = O,/TrQ’ , (9.5.16) 


which are normalized to have unit trace. To first order in M, Eq. (9.5.15) 
gives 
Xab = Xip + MacX2g + MpaX iy — 2X iTr ( MX") . (9.5.17) 
If the orientation of galaxies is random, then the average of Xj, over a 
sufficiently large sample will be proportional to 6,,, and since these matrices 
are defined to have unit trace, it follows that the coefficient of proportionality 
must be just 1/2: 
1 
(Xi) = 7 Sab ; (9.5.18) 
To deal with the term in Eq. (9.5.17) that is quadratic in X*, we also need the 
average of X’, X*,. The random orientation of the galaxies and the symme- 
try of the product requires this average to be a linear combination of 6,,6¢¢ 


2See, e.g., M. Bartelmann and P. Schneider, Phys. Rep. 340, 291 (2001). 
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and Sacdpq + Sdad5pe, and since X* has unit trace, this linear combination 
must take the form 


1 
(Xip Xi) = 7 [SubSea + E (Sacdsa + Budde — Savea )| > (9.5.19) 
where, contracting a, b with c, d in Eq. (9.5.19), 


p= (T(x s2) —1/2. (9.5.20) 


(For instance, if the galaxies in our sample are all spheres, then X* has both 
eigenvalues equal to 1/2, and £ = 0, while if they are all extremely prolate 
ellipsoids then one eigenvalue is unity and the other is zero, and £ = 1/2. 
More generally, 0 < € < 1/2.) Taking the average of Eq. (9.5.17) and using 
Eqs. (9.5.18) and (9.5.19), we have 


(Xp = Laus +0 -£) (ma = 55uTrM) (9.5.21) 
We do not observe the average of Tr(X*) that appears in our formula 
(9.5.20) for £, but since £ appears only in (9.5.21) multiplying a term that is 
already of first order in M, in calculating (X,,) to first order in M we only 
need € to zeroth order, and to this order we can replace X* in Eq. (9.5.20) 
with the observed ellipticity matrix X: 


g =(Tr(x’))- 1/2. (9.5.22) 


This can be measured from the observed shapes of the galaxies in our sample, 
and Eq. (9.5.21) can then be used to find the traceless part of the shear matrix 
from a measurement of the average of the observed ellipticity matrix X. 

A word on formalism: In the literature, in place of the real ellipticity 
matrix, one often encounters a complex ellipticity parameter 


X = Ail = Xn + 2X (9.5.23) 


and in place of Eq. (9.5.21) one finds a formula relating the complex shear 
parameter y = yı + iy) to averages of functions of the ellipticity parameter? 


(x) = 2y — 2(x Re(x*y)) . (9.5.24) 
Using Eq. (9.5.19), it is easy to see that the average on the right is 
(xRe(x"y)) = £y , (9.5.25) 


3P Schneider and C. Seitz, Astron. Astrophys. 294, 411 (1995). This is Eq. (4.16) of ref. 2, in the 
limit |y;| « l and |x| « 1. 
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and that therefore the relation (9.5.24) between x and y reads 


(ou — Qn + 2iQ12 
Q11 + Q2 


which is the same as Eq. (9.5.21). 


=21-&)(1 +iy2), (9.5.26) 


Alternative method 

It is possible to avoid the need to measure the parameter £ for a sample 
of source galaxies, by considering averages of the quadrupole matrix Qab 
itself, rather than the matrix X,, normalized to have unit trace. The random 
orientation of galaxies in our sample of sources tells us that the average of 
unlensed quadrupole matrices takes the form 


_ 
(Qi) = 7 Qôab » (9.5.27) 


with Q an unknown positive constant. Eq. (9.5.15) then gives 


-i 
(Qab) = Q Fe + Mas] . (9.5.28) 


We can eliminate the unknown Ọ by dividing by the average of the trace of 
the quadrupole moment. Eq. (9.5.28) and its trace give, to first order in M, 


(Qa) 1 1 
ae = 5ôab + Mab — 58as TtM (9.5.29) 
That is, 
= 2i 
mei (Q11 — Qn + 2iQ12) (9.5.30) 


2(Q11 + Q22) 
Thus we can calculate the traceless part of Map — that is, the shear field 
— from a measurement of the average quadrupole tensor, with no need to 
make a separate measurement of a parameter like £. 

The difference between the standard and alternative methods is not 
just one of formalism, for they involve different kinds of averages. In the 
alternative method galaxies are weighted proportionally to the area of their 
images, while in the standard method all galaxies are weighted equally. Thus 
in the alternative method one does not have to worry so much about missing 
galaxies of small apparent area. 

Frequently it is not practical to measure the redshifts of the individual 
galaxies whose images are distorted by weak lensing. In such cases we must 
assume that there is some probability distribution V (rs) for these galaxies to 
be at a radial coordinate rs, with dor N (rg) drs = 1. Using the alternative 
method described above, we can still use Eq. (9.5.29) to calculate the traceless 
part of the shear matrix from the average observed quadrupole matrix, but 
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now the average of the quadrupole matrix is calculated using all galaxies 


along a given line of sight, and the shear matrix is replaced with an effective 
shear matrix 


Mani) = | NES Mantes. fd drs 


lee) r 2 
=) | N (rs) drs / PITS l O Ey, J dr 
0 0 


rs 3YaðYb E 
y=0 
oo 32 
= Í pr +y, tr) g(r)dr, (9.5.31) 
0 | days s 
where = 
g(r) =2 i ORUS drs. (9.5.32) 
r S 


Similarly, using the standard method we can still calculate the traceless part 
of the effective shear matrix from the mean ellipticity matrix, provided there 
is no evolution in the shape of galaxies. In this case, Eq. (9.5.21) still applies, 
but with the shear matrix given by Eq. (9.5.31), and with the average squared 
ellipticity matrix in our formula (9.5.22) for the € parameter calculated by 
averaging over all galaxies along the line of sight. But if galaxy shapes 
evolve, then Eq. (9.5.31) must be modified accordingly: Instead of a factor 
1 — é multiplying the shear matrix in Eq. (9.5.21), it is necessary to use 
Eq. (9.5.22) to calculate a function é(rs) from the average of the squared 
ellipticity matrix of galaxies at a radial coordinate rs, and include a factor 
1 — &(rg) in the integrand of Eq. (9.5.31). This complication does not occur 
with the alternative method. 

Whichever method we use to extract the shear matrix from 
measurements of weak lensing, we now must face a problem. We would 
like to use weak lensing measurements to learn about the density pertur- 
bation do, but as shown in Eq. (9.5.10), it is only the convergence « that is 
related in any simple way to the density perturbation, while Eqs. (9.5.21) 
and (9.5.29) show that measurements of weak lensing tell us about the shear 
field y;, not x. Fortunately, although there is no simple relation between y; 
and x, there is a simple relation among their gradients.4 From Eq. (9.5.9), 
we have 


a rs ; 33 X 
F Matsi =2 f 228? BPA +y, t) dr. 
ON¢ 0 rs IVaIVHIVe 


y=0, t=t; 
(9.5.33) 


4N. Kaiser, Astrophys. J. 439, L1 (1995). 
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Similarly, from Eq. (9.5.31), we have 
a si a 
— Map = Í ——— So(rn + y, t;) gtr) dr (9.5.34) 
dlc 0 IVaIVHIVe y=0 r 
The important point is that both expressions are completely symmetric 


among a, b, and c. Thus the divergence of the traceless part of the shear 
matrix is proportional to the gradient of the trace: 


a 1 1a 
a (Mu Z 55«sTeM ) = TrM , (9.5.35) 


dita 2 2 diy 
where M can be either M (rs, ñ) or M (à). In other words, 


ð a a ð ð 0 
Se eee (9.5.36) 
ont on, = Ot 3M ð) = ON 


Thus « can be calculated from the shear field y;, up to an 7-independent con- 
stant. As mentioned earlier, the values of x obtained in this way can be used 
to measure the projected mass density of lensing bodies along the line of 
sight. Starting in the late 1990s, numerous groups using telescopes of mod- 
erate size have detected shear due to weak lensing. These measurements, 
together with the above analysis, have been used to map out the distribution 
of all matter, dark as well as baryonic, in various clusters of galaxies, such 
as the bullet cluster 1E0657-558 described in Section 3.4. 

Now let us turn to the application of weak lensing to find the distribution 
of inhomogeneities in the cosmological mass density. In using weak lensing 
to study the large scale distribution of matter in the universe, we are not 
so much interested in the shear matrix in any one direction, as in its dis- 
tribution around the sky. Fluctuations in the gravitational potential have 
random sign, so the average over the sky of the shear matrix is zero. It is 
most useful to consider instead the average of the product of shear matrix 
elements along two different lines of sight, with “average” understood in the 
same sense as used in the analysis of fluctuations in the cosmic microwave 
background: either an average over positions from which the sky might be 
observed, or an average over the particular series of accidents that lead to 
a particular distribution of cosmic matter. Neither average corresponds to 
what we observe, but the difference vanishes for large multipole orders, for 


5For references to these observations and a brief general review, see D. Munshi and P. Valageas, 
Roy. Soc. London Trans. Ser. A 363, 2675 (2001) [astro-ph/0509216]. Comprehensive reviews of weak 
lensing are given by P. Schneider and M. Bartelmann, ref. 2; A. Refrigier, Ann. Rev. Astron. Astrophys. 
41, 645 (2003); P. Schneider, in Gravitational Lensing: Strong, Weak, and Micro, eds. G. Meylan et al. 
(Springer-Verlag, Berlin, 2006): 269 [astro-ph/0509252]. 
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the same reason as explained in Section 2.6 for the multipole coefficients of 
anisotropies in the cosmic microwave background. 

Since we are now considering different directions, it is convenient to write 
the definition (9.5.7) of the convergence and shear fields more explicitly. If 
we write ñ in terms of polar and azimuthal angles as 


n= (sin V cosg, sind sing, cos °), (9.5.37) 
and take the two orthogonal directions normal to ñ as 


Ê = (cos Y cosg, cos Y sing,—sind), ¢=(—sing,cosy) (9.5.38) 


then 
K(f) +y Â) = db; My , (9.5.39) 
kâ) — vin) = gid; MyM) , (9.5.40) 
nÂ) = 319; My) . (9.5.41) 


We are here using Ŷ and g instead of the usual 6 and ¢ for the polar and 
azimuthal angles, to avoid confusion with other uses of the symbols 6 and 
in this section. Also, since we will now be considering different directions ñ, 
we have promoted the two-valued indices a, b, etc. to three-valued indices 
i, j, etc. Eq. (9.5.31) for the 2 x 2 shear matrix Ma» has accordingly been 
extended to give a 3 x 3 matrix 


lee) 2 
Mj (”) = [ EZ J g(r)dr. (9.5.42) 
x=rn, t=t; 


OXjOX; 
Equivalently, we can write 
y (a) = nA + in A) = epi) e4; (2) My”) , (9.5.43) 


where, as in Section 7.4, e} = (Ô + i¢)/V/2. Also, since 0 and ¢ span the 
space normal to À, 


À 1 nad A 
Ki) = 5 (5; — ñij) My (A) . (9.5.44) 
The expansion of the convergence in spherical harmonics follows along 
almost the same lines as the expansion of microwave background tempera- 


ture fluctuations described in Sections 2.6 and 7.2. We write the perturbation 
in the gravitational potential as 


50(x,t) = J d>q a(q)ddbq(te* , (9.5.45) 
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where a (q) is the stochastic parameter for scalar perturbations, normalized 
so that 

(a(q)a*(q’)) = 8q- q’) . (9.5.46) 
Then Eqs.(9.5.42), (9.5.44), and (9.5.45) give 


fi : g A iq-Îr 
KA) = —5 J dqala) [ g(r) Shatt) (a? — (a?) ear 


The quantity — (q - â)? can be replaced with 3?/3r?, and for the exponential 
we can use the familiar formula 


elt = An X it jetar) Yi") YO) . (9.5.47) 
£m 
This gives the partial wave expansion for the convergence 
Cm Y dea Y O (9.5.48) 
£m 


with coefficients 
ak tm = —2r i" J dq e aA Y7 A 


x j g(r) ddbg(tr) (ic (qr) +i/(@")) dr. (9.5.49) 


This can be used together with Eq. (9.5.46) to calculate the correlation of the 
convergence with itself, or with microwave background temperature fluctu- 
ations or any other scalar perturbations. In particular, for the correlation 
of the convergence with itself, we have 


(dem ae em!) = gemm KK,L > (9.5.50) 


where C,«,¢ is the multipole coefficient 


2 
. (9.5.51) 


Cyne = 4n? Í qf dq | [ g(r) ddbg(ty) (jar) +a) dr 


On the other hand, comparing Eqs. (9.5.43) and (7.4.6), we see that 
the shear components yı (ñ) and y2 (ñ) involve the polarization e} (ñ) in 
much the same way as the Stokes parameters O(n) and U (ñ), so for the 
same reason as in Section 7.4, it is necessary to expand the complex shear 
parameter yı (ñ) + iy2 (ñ) in the spin 2 spherical harmonics V; f), defined 
by Eq. (7.4.16): 


(€ — 2)! 


(€+2)! e+ i) ey Vi V; YA), (9.5.52) 


V'A = 2 
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where V is the angular part of the gradient operator: 


~ 149 Qo a 
V=0 i 9.5.53 
av = sin 0 dg ( ) 
Using Eqs. (9.5.42), (9.5.43), (9.5.45), and (9.5.47), we have 
via) Hin) = Y ay emV Â) (9.5.54) 
£m 


with coefficients 


: +2)! 
= 2)! 


[ea alq) Y7 0)" K Spalt jq g(r) r? dr. 


(9.5.55) 


The reality of 6@(x, t) requires that a*(q)d¢, (t) = a(—q)d¢,(t), while 
Y;"*(q) = (- Ley, ™(—G), SO ayem = ay, t-m That is, the shear pro- 
asd by perlahankan | in the mass density is of “E” rather than “B” type, 
in the sense of Eq. (7.4.25). Thus any observation of B-type shear, for which 
ay tm = =a}; _m Would be a sign of lensing caused by something other 
than density perturbations, such as gravitational waves. Unfortunately, the 
lensing due to gravitational waves produced in inflation is much too small 
to be observed.° 

Using (9.5.46) and the orthonormality of the spin 2 spherical harmonics, 


(Ay ,em ay em) = ee bmn! Cyy.e > (9.5.56) 


where the multipole coefficients in shear-shear correlations are 


£ 
dy m = 20 


4n?( + 2)! 
P eee as . (9.5.57 
ae Io) pee 


oo lee) 2 
[ alf pq (tr) je(qr) g0) r? dr 


The Poisson equation V75¢ = 42Ga7dpy relates the quantity bq (t) 
appearing in Eqs. (9.5.51) and (9.5.57) the power spectral function P(q/ao) 
of fractional density fluctuations introduced in Section 8.1: 


qf OD = (42 GPE OEY (Ob O 


= (4r GPa (OP AP (q/a0) ( z ae a 


905, Hot (t) 
= aD (Fy) 0858) 


with F(t) the function (8.1.20), and ag = a(fo). 


6s. Dodelson, E. Rozo, and A. Stebbins, Phys. Rev. Lett. 91, 021301 (2003). 
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It is possible to give less complicated formulas for the multipole 
coefficients in the most interesting case, of £ >> 1. Let us first consider 
the xx correlation. We can evaluate the integral over r in Eq. (9.5.51) by 
recalling that for £ > 1, if gr differs from £ + 1/2 by more than an amount 
of order €~!/3, the spherical Bessel function je(qr) is exponentially small 
for gr < £+ 1/2 and rapidly oscillating for gr > £ + 1/2, while (unlike the 
source functions in microwave background anisotropies) all other ingredi- 
ents in the integral vary slowly with r. For £ > 1, we can therefore replace 
r with £/q in g(r) and ¢;: 


2 


> 


Cre > 4n? [ dq q°\3bq(tejq)|” 8/9) | Í (ie(ar) +N) dr 
(9.5.59) 


The integral of je (qr) gives’ 


2 1 
Í je(qr) dr = s- = = ; = l (9.5.60) 


On the other hand, since for large £ the contributions from r « £/q and 
r > £/q are strongly suppressed, we can drop the integral over r of jy (qr). 
Thus Eq. (9.5.58) gives, for £ > 1, 


T ta 4 2 2 
Cups Í dq 4'lôha lte LEA) 


ane! | dr 15 (óe (t|? 20) 


_ 925, Hja [ ea, 
16 0 a? (t)r? 


F(t) 
F (to) 


The large £ limit of the yy multipole coefficient is precisely the same. In 
this limit, the factor (£ + 2)!/(€ — 2)! in Eq. (9.5.57) becomes £4, and the 
factor r7? in the integral over r in Eq. (9.5.57) can be replaced with (q/€)*, 
so for large £ the ratio of the integrand of the g-integral in Eq. (9.5.57) to 
that in Eq. (9.5.51) (dropping jy (qr)) is 


2 
Pean ( ) . (9.5.61) 


ég aot _ 


l 
qf 


7This is a special case of formula 6.561.14 of I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, 
Series, and Products, ed. A. Jeffrey (Academic Press, New York, 1980). 
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Hence for large £, 


992, Haas (9 g(r) F(t,)\? 
lee ae Tera E pay ac) 


(9.5.62) 


Thus we get just the same information by measuring the shear multipole 
coefficients at large £ as we would get if we could measure the convergence 
multipole coefficients at the same £. 

For a given weighting function g(r) defined by Eq. (9.5.32), observation 
of the £ dependence of Cyy,e (or Cke,e) provides a measurement of the 
k dependence of the power spectral function P(k). It should be noted 
that if the integral receives its main contribution from radial coordinates 
r such that vacuum energy has not yet become important by the times 
t,, then according to Eq. (8.1.20), F(t) « a(t), so Eq. (9.5.62) simplifies 
further, to 


92° ution g(r) 
Cyye > Cyne > Sue fe dr r =y P(e/aor) « (9.5.63) 


This is the formula usually quoted.® 

We can express the correlation functions for the shear components and 
microwave background temperature fluctuations by taking over results we 
have already found for the correlation functions of microwave background 
polarization and temperature fluctuations. Comparing Eq. (9.5.54) with 
Eqs. (7.4.15) and (7.4.25), and comparing Eq. (9.5.56) with Eq. (7.4.28), 
we see that we can obtain the correlation functions for the shear compo- 
nents from Eqs. (7.4.76)-(7.4.78) by replacing Q and U with yı and y2 and 
replacing Cgg e with C,,,¢ and (since shear is purely of E-type) dropping 
CBB: 


1 

NANEN = IL Ce Re( Fe, A+A), (9.5.64) 
1 

(ra) 9) = 5 Cove Re( Feil) — GÂ, âN), (9.5.65) 


1 
MAAD = 5 CyyeIm( — Fel) + Ge, A!) , (9.5.66) 
£ 


8D, J. Bacon, A. R. Refrigier and R. S. Ellis, Mon. Not. Roy. Astron. Soc. 318, 625 (2000). Correl- 
ation functions for elements of the shear matrix were studied by N. Kaiser, Astrophys. J. 498, 26 
(1998). 


465 


9 Gravitational Lenses 


where F; and Gy are functions given by Eqs. (7.4.87) and (7.4.88): 


Fâ i = “** ® (SIASA), (9.5.67) 


Game po (STAS), (9.5.68) 


with D® the irreducible unitary representation of the rotation group for 
angular momentum £, and S(/) the standard rotation (7.4.12) that takes 
the three-direction with 3 = g = 0 into the direction nv. Also, if we write 
the microwave background temperature fluctuation as in Eq. (2.6.1) 


AT(A) =J aremY?'@, 
£m 


and use rotational invariance to define 
(AT em d; em) = pe Ômm Cry > (9.5.69) 


then we can obtain the correlation functions for the shear components 
with microwave background temperature fluctuations from Eqs. (7.4.81) 
and (7.4.82) by again replacing Q and U with yı and y2 and replacing Cre ¢ 
with —C Tye: 


(ATA vA) = Y Cry e ReH Â, Â’) , (9.5.70) 
£ 


ATA Â) = — Y Cry e IMH Â, À’) , (9.5.71) 
£ 


where H; is the function (7.4.89): 


Hi AN = = Dy a(s! (asia ») l (9.5.72) 


For comparison, we mention also that the convergence correlation function 
is given by the analog of the temperature correlation function (2.6.4): 


2£+1 
(DEAD) = Cece Pe N) (9.5.73) 
£ 


The measurement of Cy, ¢ or (with the aid of Eq. (9.5.36) or (9.5.62)) of 


Cxx,e can be used not only to learn about the power spectral function P(k), 
but also to put constraints on the cosmological parameters that enter in 
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Eq. (9.5.32) for g(r). So far, shear measurements from the Canada—France— 
Hawaii Telescope Wide Synoptic Legacy Survey have been used” to set a 
value for the parameter og discussed in Section 8.1: under the assumption 
that Qy = 0.3, it is found that og = 0.85 + 0.06. Although this param- 
eter is also measured in studies of large scale structure, the weak lensing 
measurement has the advantage of not depending on the use of luminous 
sources as a tracer of dark matter. The same group also finds from cos- 
mic shear data alone that the ratio w = p/p, if assumed constant, is less 
than —0.8 at 68% confidence. Another group,!? combining shear measure- 
ments from the CTIO lensing survey with cosmic microwave background 
data and measurements of Type Ia supernova redshift and luminosities, has 
found that og = 0.81759, and that (if constant) w = —0.8979'5?, with 95% 
confidence. The application of weak lensing surveys to cosmology has really 
just begun. 


9.6 Cosmic strings 


The spontaneous breakdown of symmetries in the early universe can pro- 
duce linear discontinuities in fields, known as cosmic strings.! Cosmic 
strings are also common in modern string theories.” Unless we are unlucky 
enough to have a cosmic string slice through the solar system, the only way 
that a cosmic string can be discovered seems to be through its action as a 
gravitational lens. 

At a sufficient distance r from a string, the gravitational field becomes 
Newtonian. For a long straight string the solution of Laplace’s equation 
with cylindrical symmetry has a gravitational potential of the form 


= —2Golnr+C, (9.6.1) 


where o and C are constants of integration. If the string is non-relativistic, 
then o is the string’s mass per length. At a sufficient distance from the string 
the post-Newtonian approximation applies, and the direction w of a light 


°H. Hoekstra et al., Astrophys. J. 647, 116 (2006) [astro-ph/051 1089]. 

10M. Jarvis, B. Jain, G. Bernstein and D. Dolney, Astrophys. J. 644, 71 (2006) [astro-ph/0502243]. 

IT W. B. Kibble, J. Phys. A 9, 1387 (1976); A. Vilenkin and E. P. S. Shellard, Cosmic Strings and 
Other Topological Defects (Cambridge University press, Cambridge, 1994). 

2E, Witten, Phys. Lett. B 153, 243 (1985); E. J. Copeland, R. C. Myers, and J. Polchinski, J. High 
Energy Phys. 0406, 013 (2005) [hep-th/0312067]. 

3The relativistic calculation of lensing by a cosmic string is due to J. R. Gott, Astrophys. J. 288, 422 
(1985). The anisotropy in the cosmic microwave background due to cosmic strings was calculated by 
N. Kaiser and A. Stebbins, Nature 310, 391 (1984). 
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ray is governed by the equation* 


a = —2u x (ux Vo). (9.6.2) 


As long as the deflection of the light ray is small, it can be calculated as 
du = —2ùo x (Uo x [ ave) ' (9.6.3) 


where i is the photon’s initial direction. An elementary integration shows 
that the light ray is deflected toward the string in a direction perpendicular 
to both the string and the light ray, by an angle 


= 4roG 
sind 


|5i| 


; (9.6.4) 


where 0 is the angle between the directions of the string and the light ray. 
Because the deflection is toward the string, it is in the opposite direction 
for rays passing the string on opposite sides, so that the image of a source 
behind the string is split into two parts. It is noteworthy that neither the 
direction nor the magnitude of this deflection depends on the distance of 
the light ray from the string, as long as the distance is large enough to allow 
the use of the non-relativistic formulas (9.6.1) and (9.6.2). 

In 2003, general interest in cosmic strings was heightened by the dis- 
covery of what seemed at first to be a plausible candidate for lensing by a 
cosmic string. A pair of images of elliptical galaxies separated by about 1.8 
arcseconds was found to have substantially the same redshift, z = 0.46, and 
the same spectra. The images were not distorted in the way that would be 
expected for lensing of a single galaxy by a more-or-less spherical source, but 
are consistent with lensing by a cosmic string.» Subsequently an excess of 
gravitationally lensed objects was found in the neighborhood of this string 
candidate,° lending further support to the view that this was the image of 
a single galaxy lensed by a cosmic string. But in 2006 this interpretation 
had to be abandoned, when observations at the Hubble Space Telescope 
revealed that this was in fact a pair of interacting elliptical galaxies, not the 
result of any sort of lensing.’ This episode illustrates how difficult it will be 
to detect cosmic strings through their lensing action. 


4See footnote | of Section 9.3. 

5M. V. Sazhin et al., Mon. Not. Roy. Astron. Soc. 343, 353 (2003) [astro-ph/0302547]. 
6M. V. Sazhin et al., astro-ph/0406516. 

TM. V. Sazhin et al, Mon. Not. Roy. Astron. Soc. 376, 1731 (2007) [astro-ph/0611744]. 
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Inflation as the Origin of Cosmological 
Fluctuations 


The most exciting aspect of the inflationary cosmological theories described 
in Chapter 4 is that they provide a natural quantum mechanical mecha- 
nism for the origin of the cosmological fluctuations observed in the cos- 
mic microwave background and in the large scale structure of matter,! and 
that may in the future be observed in gravitational waves.? We have seen 
in Chapter 6 that the magnitude and wavelength dependence of adiabatic 
scalar and tensor fluctuations depend on initial conditions only through 
the quantities R4 and Dj, respectively. As given by Eq. (5.4.24), R4 is the 
value of the gauge-invariant quantity R,(t) = Aq4(t)/2+ H (t)dug(t) outside 
the horizon, that is for g/a(t) « H(t), where for adiabatic fluctuations Rq is 
time-independent. Likewise, D4 is the time-independent value of the grav- 
itational wave amplitude D,(t) for g/a(t) « H(t). During the matter- or 
radiation-dominated eras a(t) increased like 17/3 or t!/?, respectively, while 
H(t) = a(t)/a(t) decreased like 1/t, so any wavelength will be found outside 
the horizon if we go back early enough in one or the other of these eras. 
But during the period of inflation that is supposed to precede the radiation- 
dominated era, H(t) was roughly (perhaps very roughly) constant, while 
a(t) increased more-or-less exponentially, so even if a perturbation was out- 
side the horizon at the end of inflation it would inevitably have been found 
deep inside the horizon sufficiently early in the era of inflation. At these very 
early times fields oscillated much more quickly than the universe expanded, 
and their quantum fluctuations therefore were essentially just what they 
would be in ordinary Minkowski spacetime. In this chapter we will follow 
adiabatic scalar fluctuations and tensor fluctuations from this very early 
era, through the epochs when fluctuations of various wavelengths exited the 
horizon, to the time when R4(t) and D,(t) reached the constant values Ra 
and D7 that are measured in the cosmic microwave background and in the 
large scale structure of matter, and that may some day be measured through 
the direct detection of cosmological gravitational radiation. 


1S, V. Mukhanov and G. V. Chibisov, Sov. Phys. JETP Lett. 33, 532 (1981); S. Hawking, Phys. Lett. 
115B, 295 (1982); A. A. Starobinsky, Phys. Lett. 117B, 175 (1982); A. Guth and S.-Y. Pi, Phys. Rev. 
Lett. 49, 1110 (1982); J. M. Bardeen, P. J. Steinhardt, and M. S. Turner, Phys. Rev. D 28, 679 (1983); 
W. Fischler, B. Ratra, and L. Susskind, Nucl. Phys. B 259, 730 (1985). 

2The cosmological generation of gravitational waves was considered by L. P. Grishchuk, Sov. Phys. 
JETP 40, 409 (1974); A. A. Starobinsky, Sov. Phys. JETP Lett. 30, 682 (1979), and calculated in 
the context of inflation by V. A. Rubakov, M. V. Sazhin, and A. V. Veryaskin, Phys. Lett. 115B, 189 
(1982). 
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10.1 Scalar fluctuations during inflation 


We will first consider the simplest model of inflation, with a single real 
“inflaton” field g(x), and an action (B.63): 


1 
I= fex y —Detg -n Laa ro)| ‘ (10.1.1) 


Ox! ax” 


involving an arbitrary real potential V(g). This is not the only possibility; 
we will take up the question of its plausibility at the end of this section. 

In line with the observed isotropy and homogeneity of the universe on 
the average, we take the scalar field as an unperturbed term g(t) that depends 
only on time, plus a small perturbation dg(x, t): 


p(x, t) = P(t) + ôy (x, t) . (10.1.2) 


Similarly, as in Chapters 5-7, the metric is given by the unperturbed 
Robertson—Walker metric g,,,(¢) (with K = 0) plus a small perturbation 
Auv (x, t) 
Suv Xt) = Zpv(O +h t). (10.1.3) 
The energy momentum tensor of the scalar field is shown in Appendix B 
to take the perfect fluid form, with an energy density, pressure, and velocity 
four-vector given by Eqs. (B.66)-(B.68) as 


l dp dy 
= —- g 4 
p 28 Axil axe t (p) 
1 dp dy 
=i gt =V. 
p=-38 Fn a P 
—1/2 
ias (ees a E, 
Ox? 0x? Oxt 


The energy-density and pressure of the unperturbed scalar field are then 


ae hed E ay ub lee s 
p=5P+V@, P=30-V®, (10.1.4) 
while the unperturbed velocity four-vector has components 
w=0, W=1. (10.1.5) 


The Friedmann equation (with zero spatial curvature) is here 


H’ = ae (5 + ræ) , (10.1.6) 
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and the energy conservation condition p = —3H(( + p) yields the field 
equation of the unperturbed scalar field 


~9+3HG+V'G) =0, (10.1.7) 


where as usual H(t) = a(t)/a(t). 
The perturbation dg,» = hy» to the metric will be taken in the 
Newtonian gauge form (5.3.18): 
hoo =—2¥, hoi =0, hy = 2075; . (10.1.8) 
(The scalar anisotropic inertia term zs, as well as x” and x7, vanishes 
in scalar field theories, so according to Eq. (5.3.20) the other scalar grav- 
itational perturbation ® here equals Y.) The perturbations to the pres- 
sure, energy density, and velocity three-vector are given by the terms in 
Eqs. (B.66)—(B.68) that are of first order in perturbations: 


dp = 966 + V'(G)59 — UG" , (10.1.9) 

bp = 95 — V'(P)59 — UG" , (10.1.10) 
as 5 

hi Where: én. (10.1.11) 
ox! Q 


The field equations may be taken as the Einstein equation Eq. (5.3.21) and 
the energy conservation condition (5.3.24), which here take the form 


w+ HY =41GOsg, (10.1.12) 


3V (g y? aV@) |. 
spt Hoot P 5g -hlo = -2w P sae (10.1.13 
ag a? 3p 
while the constraint (5.3.26) is here 
. V : 7 
H- Y = 4rG( — G39 + ğöp) . (10.1.14) 
a 


(In deriving Eq. (10.1.14), we make use of the convenient relation H = 
—4x Gg’, which follows from Eqs. (10.1.6) and (10.1.7).) The field equation 
(5.3.20) has already been accounted for by setting ® = Y, while the remain- 
ing field equations (5.3.19) and (5.3.22) and the momentum conservation 
equation (5.3.23) just repeat the information contained in Eqs. (10.1.12)— 
(10.1.14). 

Let us first consider the plane wave solutions of Eqs. (10.1.12)-(10.1.14), 
in which dg and Y are of the form exp(iq - x)dgq(t) and exp(iq - x) Vy (t), 
respectively. We will return later to the issue of how to put these solutions 
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together in forming the perturbations 5g(x, t) and W(x, t). The time depen- 
dence of the plane wave solutions is given by Eqs. (10.1.12) and (10.1.13), 
with —V? replaced with q? = q’: 


Wy + HW, =41G G gpI, (10.1.15) 
. VE) g 
ôq + 3H êq + ag Ma + a dq 
ƏV (p) a 
=-2U, T +40,¢, (10.1.16) 


while the constraint (10.1.14) now reads 


2 
(a+ 5) Wy = 40 G( — $ ôd + Ë ôpq) . (10.1.17) 
At sufficiently early times q/a will be much larger than H or 3? V (Ẹ)/3ğ?, 


so we can look for WKB solutions, for which time derivatives of fields yield 
factors of order q/a, of the form 


í d / 
5yq(t) > f(t) exp (-i J ; } 
t. 


„ Ut’) 


t dt! 

Yalt) > g(t) exp (-i [ a) ; (10.1.18) 
where f(t) and g(t) vary much more slowly than the argument of the 
exponential, and ¢, is arbitrary. The exponential factor is chosen so that the 
terms in Eq. (10.1.16) of second order in g/a should cancel. Eqs. (10.1.15) 
and (10.1.17) are both satisfied to leading order in q/a if we take 


g/f =4inGga/q . (10.1.19) 


The terms in Eq. (10.1.16) of first order in g/a then give 2f +2Hf = 0, 
sof « l/a. For reasons that will soon become clear, we will choose the 
common constant factor in both f and g so that f = 1/(27)7/*,/2qa. 
With this normalization, dg,(t) and W,(t) are defined as the solution of 
Eqs. (10.1.15)-(10.1.17) that satisfies the initial condition, that for a(t) > 0, 


1 t dt 
ê —i nc 10.1.20 
a (21r)3/2a(t),/2q E ( 4 Í. a) 


Ain GG(t) f 
Y(t) > owe (-i [ =) : (10.1.21) 
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The complex conjugate ôo) (t), Wa) is another independent solution, and 
since the system of equations (10.1.15)-(10.1.17) is second order, these are 
the only solutions. 

In general, the fields 5y(x,t) and W(x, t) satisfying Eqs. (10.1.12)- 
(10.1.14) can be written as superpositions of these two solutions, which 
the reality of the fields requires to take the form 


d9(x,t) = J dq [speta + spie] (10.1.22) 
Yx, t) = J dq [woei taa + oea] . (10.1.23) 


Now we must say something about the coefficients æ (q) and a*(q). For 
this purpose, we use the canonical commutation relations of the fields. The 
interaction of the scalar field with gravitation makes these commutation 
relations rather complicated, but they become simple at very early times. For 
any given q, we can find a time sufficiently early so that the expansion rate H 
is negligible compared with q/a. For such early times, both g and ¢ on the 
right-hand side of Eq. (10.1.17) become negligible, so Yg becomes negligible, 
as can also be seen by noting from Eqs. (10.1.18) and (10.1.19) that the ratio 
of W, to dg, vanishes like a(t) for a(t) + 0. At such early times, as far as a (q) 
and a@*(q) are concerned, we can find the canonical commutation relations 
by using Eq. (10.1.1) with g,,, taken as the unperturbed Robertson—Walker 
metric: 


1 (dg 2 1 d9 dg 
I= | d*xly, Lyo=at — — -V ; 
? | ja ee eol (2) mo A 


The canonical conjugate to the field g is then x = 0L,/d9 = a. Since the 
unperturbed fields are c-numbers, the commutators of perturbations are the 
same as the commutators of the fields themselves. This gives, for very early 
times, 


[pan 600.0] =0, [oan 800D] =1a-¥(N 3%K —y), 
(10.1.24) 
With 5g,(t) normalized to satisfy Eq. (10.1.20) for a(t) > 0, these com- 
mutation relations imply that a(q) and a*(q) behave as conventionally 
normalized annihilation and creation operators 


a(q),a(q’)}=0, Jaq), ae*@q)|=sq—q). (10.1.25) 
| j=% | | 


It may come as a surprise that the same annihilation and creation operators 
appear in the scalar field and gravitational perturbations, but it should be 
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keptin mind that Y does not represent gravitational radiation, whose quanta 
are created and annihilated by independent operators. Rather, WV is an 
auxiliary field, given by Eq. (10.1.14) as a functional of the inflaton field ôg, 
in much the same way that in the Coulomb gauge quantization of quantum 
electrodynamics, the time-component of the vector potential is a functional 
of the charged matter fields.* 

It may be noted in passing that the scalar field we have constructed 
is in accord with the Principle of Equivalence of general relativity. As 
long as we do not concern ourselves with co-moving wave numbers below 
some infrared limit Q (as for example by confining measurements to a 
cube of co-moving volume less than (27/Q)°), then at times for which 
Q/a > H the form of the scalar field 5g(x, t) ought to be essentially the 
same as a free massless real scalar field g(x, t) in ordinary Minkowskian 
space-time: 


3 
J one E (k)e"*'* exp (-i J k ar) + A*(k)e T2 exp (: J k dr)| i 


where A(k) and A*(k) are annihilation and creation operators, satisfying 
the familiar commutation relations 


[A(k), A(k)] =0,  [A(k), A*K')] = 8k — k’) . 


Note that the space coordinate appearing in the exponentials is ax, because 
in the K = 0 Robertson—Walker metric this is the vector that measures 
proper distances. It follows that k is related to the time-independent 
co-moving wave number vector q by k = q/a, so it is time-dependent, which 
is why we had to write the time-dependence factors as exp (+i fk dt) rather 
than exp(Fikt). Also, 8 (k — k’) = a*53(q — q’), so we can define operators 
a(q) that satisfy (10.1.25) as æ (q) = a7? A(k). Changing the variable of 
integration from k to q then gives the field for a(t) > 0 as 


d’q/a a?/2 iq:x 
TE l a (q)e'T* exp (-: f adia) 


4a3/2a* (qet exp (i f adira) 


in agreement with Eqs. (10.1.20) and (10.1.22). 
Finally, we have to choose the quantum state of the inflaton field during 
inflation. Though there are other possibilities, the simplest and most natural 


3See QTF, Vol. I, Eq. (8.2.9). 
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assumption is that the state of the universe during inflation is the vacuum 
\0), defined so that* 


a(q)\0)=0, (0/0) =1. (10.1.26) 


One other possibility is that it is a linear combination of œ and a* rather 
than @ that annihilates the state |0).> Another is that inflation takes place 
in the presence of a thermal distribution of inflatons.® 

But although not certain, the assumption (10.1.26) is at least plausible. 
The initial condition (10.1.20), which picks out a particular solution of 
the second-order system (10.1.15)-(10.1.17), is imposed at an early time, 
at which g/a >> H. At such times, we can treat the action as if it were 
nearly time-independent, so there exists a Hamiltonian operator H which 
to a good approximation generates the time-dependence of the fields: 


[H, ôg(xX, t)] ~ —i9(x, t) $ 


According to Eq. (10.1.18), for g/a >> H the time dependence of the coef- 
ficient function q(t) is given approximately by 


Qq = —1(q/a)q >’ 


so 

[H,œ(q)] > —(g/a) a (q) . 
Hence if a state |y) is an eigenstate of H with energy E, then a(q)|wW) is an 
eigenstate of H with a lower energy ~ E — q/a, unless |y} is the state |0) 
for which a(q)|0) = 0, which is therefore the state of lowest energy. Just 
as in ordinary laboratory physics, we expect any other state to decay into 
the state |0) of lowest energy, although there remains a question whether 
the Coy occurs rapidly enough to be effective in the period before horizon 
exit. 

In the state satisfying Eq. (10.1.26), quantum averages (0| (x1, Hdg 
(x2, t) --- |0) (as well as those also involving W(x, f)) may be calculated by 
moving all annihilation operators «œ (q) to the right and all creation operators 
a*(q) to the left, picking up commutators when a a(q) is moved to the right 
past a a*(q’) or a a*(q) is moved to the left past a a(q’). The result is then 
given by Wick’s theorem: The quantum averages of products of 5ys and/or 


4This state is often called the Bunch-Davies vacuum: see T. S. Bunch and P. C. W. Davies, Proc. Roy. 
Soc. Ser. A 360, 117 (1978). 

5E. Mottola, Phys. Rev. D 31, 754 (1985); B. Allen, Phys. Rev. D 32, 3136 (1985). 

6K. Bhattacharya, S. Mohanty, and R. Rangarajan, Phys. Rev. Lett. 96, 121302 (2006). 

7This is studied by C. Armendariz-Picon, J. Cosm. & Astropart. Phys. 0702, 031 (2007) [astro- 
ph/0612288]. 

8See QTE, Vol. I, Sec. 9.1. 
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Ws are Gaussian in the sense of Appendix E (except that we must keep track 
of the order of operators), with the pairings given by vacuum expectation 
values of products of the paired fields: 


(OlSy(x, NSe(y, 1/0) = J d°q leg iD, (10.1.27) 
(O|W(x, HV (y, £10) = [ea [Yq OD , (10.1.28) 
(O[Sp(x, HY (y, 010) = J Aq dpq(th¥z(t) el" , (10.1.29) 
(O| W(x, Hde(y, 1)|0) = J dq WAO SpA AT. (10.1.30) 


For instance, 


Olp (w, DP (x, OY (y, Hdg(z, t10) 
= (08y (w, W(x, £)10) (0% (y, Hdg(z, t10) 
+ (Olp (w, HY (y, £10) (OY (x, HdpCz, t)10) 
+ (Olôp(w, t)ôp (z, t)10) (O| (x, £) Y (y, £)10) 


These are quantum averages, not averages over an ensemble of classical 
field configurations. We see this most clearly in Eqs. (10.1.29) and (10.1.30), 
which give complex results for the averages of products of real fields, and 
consequently also depend on the order of the fields.’ Just as in the measure- 
ment of a spin in the laboratory, some sort of decoherence must set in; the 
field configurations must become locked into one of an ensemble of classical 
configurations, with ensemble averages given by the quantum expectation 
values calculated as in Eqs. (10.1.27)-(10.1.30) It is not apparent just how 
this happens, but it is clear that decoherence cannot occur until expectation 
values of products of real fields become real, which for free fields will also 
imply that the expectation values do not depend on the order of the fields. 
As we shall see, this happens after perturbations leave the horizon, when the 
various functions g(t) and W(t) become dominated by a single solution 
of the field equations, which (since the field equations are real) is necessarily 
real up to a possible complex factor. Once the universe becomes classical in 
this sense, we can invoke the Ergodic Theorem of Appendix D to interpret 
averages over ensembles of possible classical universes as averages over the 
position of the observer in our universe. 

For any given potential V(@), it is always possible to find g(t) and 
W(t) by numerically solving Eqs. (10.1.15)-(10.1.17), subject to the initial 


°For a discussion of this point, see D. H. Lyth and D. Seery, astro-ph/0607647. 
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conditions (10.1.20) and (10.1.21). But this is complicated, and gives more 
information than we can use. Between the time of inflation and the present 
there intervenes a so-called reheating period when the energy of the inflaton 
field ọ is converted into ordinary matter and radiation. We know essentially 
nothing about this process, so the solutions for 6g and Y during inflation 
do not have an immediate interpretation in terms of observations of the 
present universe. Fortunately, the reheating era (and other ill-understood 
eras) occur when all cosmological fluctuations of observational interest are 
outside the horizon. The one use that we can make of the solutions for the 
fields during inflation is to calculate some quantity that becomes conserved 
outside the horizon, and that thus provides an initial condition for the 
evolution of perturbations after they re-enter the horizon. 

Here we will concentrate on the quantity R discussed in Section 5.4, 
defined in Newtonian gauge by 


R=—-W+ Hou. (10.1.31) 


With a single scalar field this is conserved outside the horizon during inflation, 
because as shown in Section 5.4 there are always two solutions for which œR is 
constant for g/a <« H, and as already noted the equations for 
inflation with a single scalar field only have two independent solutions. 
Using Eqs. (10.1.11) and (10.1.22)-(10.1.23) we find that during inflation 


Rix, t= J dq [Ro itaq) + RE (1) ea (q) | ,  (10.1.32) 
where 

Rq = —Vq — Hb¢q/¢ . (10.1.33) 

Of course, the quantum averages of products of R(x, t)s are Gaussian in 


the same sense as those of W,(t) and dg,(t), with pairings given by the 
expectation value 


OIRE, OR(y, £)|0) = J dg FY RA (10.1.34) 


Instead of calculating Y,(t) and dg,(t) and then using the results to 
calculate R4(t), it is much more convenient to solve a differential equation 
for Rq(t) itself. This equation can be derived in Newtonian gauge with 
some trouble from Eqs. (10.1.15)-(10.1.17), but it is more easily derived in 
a different gauge, defined by the conditions 


ôg =0, Bqı=0 
Inspection of Eqs. (B.66)-(B.68) shows that in this gauge 


1 2 1 fe 
6p = 6p = x hoe” = =i , ôu=0. 
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Also, as mentioned in Appendix B, the energy-momentum tensor for a single 
real scalar field has the perfect fluid form, so the anisotropic inertia mj 
vanishes. The gravitational field equations (5.1.44) and (5.1.46) and the 
energy conservation equation (5.1.49) then give 


0=HE+23H?+H)E+a°V7-A—-A-6HA+2a 'HVF, 


0=-HE +4 
E l T E E 
0=->— (EH) -3HHE-a'HWF +>HA, 
2 dt 2 
in which we have again used the relation H = —47G|g|?. Eliminating E 


and F yields a differential equation for A: 


p 2H ÄĤ\. lo, 
A+ {13H —- — + — |4- VA =Q. 
H H a? 


The gauge invariant formula (5.4.22) tells us that in this gauge R = A/2, so 
the same equation applies to R. Going over to its Fourier transform, this 
gives what is sometimes known as the Mukhanov—Sasaki equation:!? 


dR 2 dz dRg 


dt? zdt dt +4 Ra =0, (10.1.35) 
where t is the conformal time 
t dt 
ref =. (10.1.36) 
ty 


with ¢, an arbitrary time, to be chosen later, and 


Z 


Ii 
TJs.. 


(10.1.37) 


The initial condition is provided by returning to Newtonian gauge, and using 
Eqs. (10.1.20) and (10.1.21) in Eq. (10.1.33). Assuming that a() (1) /H (t) 
vanishes in the limit a(t) —> 0, only the term in Eq. (10.1.33) proportional 
to ôgq4 contributes in this limit, and we find that for a(t) > 0: 


E H(t) 
(27)3/2,/2q a(t) PA) 


Rq t) > exp (—iqT) (10.1.38) 


10V, S, Mukhanov, JETP Lett. 41, 493 (1986); S. Sasaki, Prog. Theor. Phys. 76, 1036 (1986); V. S. 
Mukhanov, H. A. Feldman, and R.H. Brandenberger, Phys. Rep. 215, 203 (1992); E. D. Stewart and 
D. H. Lyth, Phys. Lett. B 302, 171 (1993). 
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For a given potential we must integrate Eq. (10.1.35) out from a = 0 to 
beyond the horizon, where g/a « H. In this limit we can drop the QP Rg 
term in Eq. (10.1.35), which then has two solutions, a dominant solution 
with Ry a non-zero constant, and a solution for which Rq approaches zero 
with dR,/dt decaying as 1/z?. It is the constant limit Rå of Rq(t) outside 
the horizon that we need. (As we saw in Section 5.3, the value of the quantity 
¢q that is sometimes used in analyses of cosmological fluctuations is given 
far outside the horizon by tg = Ry) 

It is striking that within the scope of the general assumptions made here, 
it is not necessary to make any arbitrary assumptions about the strength 
of cosmological fluctuations. For any given potential, we need only solve 
equation (10.1.35) with the initial condition (10.1.38), and carry the solution 
forward in time to when Rq(t) reaches its constant value RG outside the 
horizon. 

On what features of the potential does Rg depend? The initial behavior 
(10.1.38) of Rq(t) deep inside the horizon is independent of the nature of 
the potential, while outside the horizon R(t) simply becomes constant. 
Thus Rå can depend only on the behavior of the potential V(g) for values 
of ọ near the value taken by 9(t) at the time the perturbation leaves the 
horizon. 

This has an important implication for the part of the era of inflation that 
can be revealed through observations of scalar fluctuations. As we have just 
seen, in observing a fluctuation with co-moving wave number q, we learn 
about the time fy of horizon exit during inflation, when g/a(tg) = H (tq). 
To put this another way, the number of e-foldings N (q) between the time tg 
that we learn about in observing a perturbation of wave number q and the 
beginning of the radiation-dominated era at a time t is!! 


2 at) \ a(to) H (to) (tq) 
N(q) =|n (a) =No+In (ae ) +In (Fe) , (10.1.39) 


where 


a(to)H (to) 


and as usual fg is the present time. In particular, we cannot observe any 
fluctuation unless q is large enough to have entered the horizon by the 
present time, which requires that the present physical wavelength a(to)/q be 
less than the present horizon distance  1/H (to), so the maximum number 
of e-foldings before the beginning of the radiation-dominated era that can 


Nosta (N (10.1.40) 


11A, R. Liddle and S. M. Leach, Phys. Rev. D 68, 103503 (2003) [astro-ph/0305263]. 
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ever be observed is 


7 H(t) 
Nmax = No + In (Fe)... , (10.1.41) 


in which the second term on the right is the maximum value of In[H(t)/ 
H (tı)] for t within the last Nmax e-foldings of inflation. In general we expect 
energy to be lost during inflation and in the reheating phase at the end of 
inflation, so H (texit) > H(t1) and hence Nmax > No, but for slow roll 
inflation it may not be a bad approximation to neglect this energy loss, in 
which case the second term in Eq.(10.1.41) can be neglected, and 


Nmax > No . (10.1.42) 


The reason that we have chosen to write the formula for N (q) as in 
Eq. (10.1.39) is that we have already calculated No in Section 4.1; it is 


1/4 
Py 
ery ee oe 10.1.43 
No (asa) ( ) 


where p; is the energy density at the beginning of the radiation-dominated 
era. For instance, if we take h = 0.7 and p} = G~? = [1.2 x 10!9 GeV}, 
then No ~ 68, and according to Eq.(10.1.42) we can only explore the final 
68 e-foldings of inflation. We will re-evaluate this bound in Section 10.3, 
with a better estimate of p1. 

There is one form of the potential for which the constant Rj can be cal- 
culated analytically, with no further approximations. !? It is the exponential 
potential 

V (o) = g e™? (10.1.44) 


(with g and A arbitrary real constants) which we have already considered 
in Section 4.2. Of course, the potential cannot have this form for all 
ọ, or inflation would never end, but as remarked in the previous para- 
graph, in the calculation of Rọ it is only relevant that the potential should 
take this form for values of the field near the value it takes at horizon 
crossing. 

The solution of Eqs. (10.1.6) and (10.1.7) for the exponential potential is 


(10.1.45) 


12L, F. Abbott and M. B. Wise, Nucl. Phys. B 244, 541 (1984); F. Lucchin and S. Matarrese, Phys. 
Rev, D 32, 1316 (1985); Phys. Lett. B 164, 282 (1985); D. H. Lyth and E. D. Stewart, Phys. Lett. B 
274, 168 (1992). 
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and 
H = l/et , (10.1.46) 
where € is the positive dimensionless quantity 
H Me 
=— = ; 10.1.47 
<=? ~ 6nG Á ) 


Inflation with this potential is often called power-law inflation, because 
a œx t'/€, It is convenient for the moment to normalize the co-moving 
coordinates so that 
a=“, (10.1.48) 

Note that ag?/H œ t7-©®/€, so our previous assumption that ag?/H 
vanishes for a(t) — 0 is satisfied if € < 1, as we will assume. 

For this potential it is convenient to take the constant t, in the definition 
(10.1.36) of conformal time as tą = oo, in which case the conformal time is 
negative 


l—e 


Tes (+) po U-O/e (10.1.49) 


As ¢t and a(t) run from zero to infinity, t runs from —oo to zero. In terms 
of t, Eq. (10.1.35) now reads 


aRg 2a Re 
dt? (l—e)t dt 


+R, =0, (10.1.50) 


whose solutions are proportional to t” times a Hankel function HS (—qt) 
or HË (—qr), where 


1 2 3 
pois oe eee (10.1.51) 
2 l—e 


For large real x the first Hankel function has the asymptotic behavior 


|2 
H (x) = ey (ix — ivz/2 — ir /4) > 


and HP® (x) = H* (x), so the initial condition (10.1.38) picks out the 
solution « ag he (—qt), and fixes its normalization so that 


> B AVT € =1/(1—e) 
gt) = KOAI 
x exp (> + =) (CTP HP (—qr). (10.1.52) 
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For small real x 


HG) > = © OY 


2 


so outside the horizon, in the limit q/a «< H where —gt < 1, the quantity 
Rg approaches a constant Rẹ, given by Eq. (10.1.52) as 


Arw) e \7M09 inv in /q\- 
Oo _ tas 
r= i) exp (+5 (5)  (10.1.53) 


For purposes of comparison with results for other potentials, it is conve- 
nient to rewrite this in terms of quantities at the time t4 of horizon crossing, 
defined by 


q/a(tq) = H (tg) . (10.1.54) 


Solving this gives tg = (€q)/“"-©® and H(tg) = q7“ 079 e7!/0-9. It is 
also convenient to use Eq. (10.1.47) to express A in terms of € and G. Then 
Eq. (10.1.53) can be written 


> T (v)2»73/2 7 inv in 
= = 3/2 — -)t/d-e) 
bo = Ry = iv lon Gq” HH (tq) ‘fe (1 —e) © exp 5 + a): 


(10.1.55) 


Note that this formula does not depend on the convention we have chosen 
in Eq. (10.1.48) for the constant factor in a(t). 

To have a sufficient number of e-foldings in inflation, it is necessary for 
the potential to be fairly flat, so we are chiefly interested in the case where 
€ is small. Fore < 1, we use (3/2) = yx /2, and write Eq. (10.1.55) as 


1 


——... 10.1.56 
an? fe ( ) 


62 = RE = —iV 16r Gq? H (tq) 
The most important point is that for a nearly flat potential, for which H (tq) 
is nearly g-independent, R4 is nearly proportional to q7?/2. This result is 
not limited to potentials of exponential form, as long as the potential is 
fairly flat for fields near Q(t,). 

The approximate g~*/* dependence of Rå on q is well supported by 
observation. We saw in Section 8.1 that the power spectral function P(k) 
that governs the distribution of dark matter has a dependence on the physical 
wave number k that for small k is given by a factor KR Lay |I, so any potential 
that is fairly flat for fields near ø (t4) gives a spectrum close to the Harrison- 
Zel'dovich spectrum, P(k) œ k for small k. Also, as we saw in Section 
6.3, the spectral function Pg (q) for the Newtonian gravitational potential is 
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proportional for long wavelengths to IROP, which as shown in Section 2.6 
implies that (aside from the integrated Sachs—Wolfe effect and the effects of 
relonization) the quantity (£ + 1)C? becomes independent of £ for small 
£. And we saw in Sections 7.2 and 7.4 that the near proportionality of Rg 
to q~>/? gives results for the correlations of temperature and polarization 
fluctuations in the cosmic microwave background at larger values of £ in 
good agreement with observation. 

To be more precise, Eq. (10.1.53) shows that for the exponential potential, 
the scalar spectral index defined in Section 7.2 by IR œ q74™”s has the 
constant value 


2e 
— E€ 5 


ns=4-2=1— 


The third-year WMAP result ns = 0.958 + 0.016 quoted in Section 7.2 
thus shows that for the exponential potential, € = 0.021 + 0.008. This is 
small enough so that we can use the approximate formula (10.1.56). Writing 
IR2I? ~ |N |747, we see that for the exponential potential 


where Hexit is the expansion rate at horizon exit, now ignoring its weak 
dependence on q. The third-year WMAP result |N|? = (1.93 + 0.12) x 
107! quoted in Section 7.2 thus shows that for the exponential potential, 
Hexit X 20|N|/e/G x 1.5 x 10!4 GeV, corresponding to an energy density 
3H? „/87G ~ [2.6 x 10!° Gev]}. 

Of course, the exponential potential is just one special case, with the 
special property that V’”/V = (V'/V)?, so there is no reason to expect these 
results to apply in detail for other potentials. In Section 10.3 we will see 
what to expect for general potentials, under the slow-roll approximation. 
But within the slow-roll approximation, the results for general potentials 
are similar in order of magnitude to those we have already found for the 
exponential potential. 

We are now in a position to consider the plausibility of the assumption 
that the action has the simple form (10.1.1).!> With a single real scalar field 
x, there is no loss of generality in taking the action in the form (10.1.1) if 


3There is a large literature on other possibilities. Here is a partial list: C. Armendariz-Picon, 
T. Damour, and V. Mukhanov, Phys. Lett. B 458, 209 (1999) [hep-th/9904075]; J. Martin and R. 
H. Brandenberger, Phys. Rev. D 63, 1235012 (2001) [hep-th/0005209]; R. H. Brandenberger and J. 
Martin, Mod. Phys. Lett. A 16, 999 (2001) [astro-ph/0005432]; J. C. Niemeyer, Phys. Rev. D 63, 
123502 (2001) [astro-ph/0005533]; J. C. Niemeyer and R. Parentani, Phys. Rev. D 64, 101301 (2001) 
[astro-ph/0101451]; A. Kempf and J. C. Niemeyer, Phys. Rev. D 64, 103501 (2001) [astro-ph/0103225]); 
R. Easther, B. R. Greene, W. H. Kinney, and G. Shiu, Phys. Rev. D 64, 103502 (2001) [hep-th/0104102]; 
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we assume that the action contains at most two spacetime derivatives. The 
most general action in this case is 


1 
= fex v —Detg |-5Kooet i = uoo] , 
where K and U are real functions of x that are arbitrary, except that unitarity 
requires K > 0. This can be put in the form (10.1.1) by introducing g = 
fax KQ). 

But why should the terms in the action contain no more than two space- 
time derivatives? This is only one of a number of similar questions: Why 
should the action of gravitation contain only the Einstein term, propor- 
tional to f d4x,/—DetgR, and not other generally covariant terms with 
more than two derivatives of the metric? And why should the action of 
the standard electroweak model or quantum chromodynamics contain only 
renormalizable terms? 

We do not know the answers to any of these questions with any cer- 
tainty, but we have at least a plausible possible answer. Any of these addi- 
tional terms in the action would involve operators of higher dimensionality, 
but dimensional analysis requires that such terms must be accompanied 
by coefficients containing additional negative powers of some fundamental 
mass. If that mass is large enough enough, then the additional terms in the 
action are suppressed under ordinary conditions. For the success of general 
relativity, it is only necessary that the length 1/M be sub-macroscopic, but 
unless we impose a condition of baryon and lepton conservation, the funda- 
mental mass appearing in the action of quarks and leptons has to be at least 
of order 10!° GeV, in order to suppress proton decay below experimental 
bounds. 

The inflaton field is considered to take values of the order of the Planck 
mass, so there is nothing to suppress arbitrary powers of the scalar field in 
the inflaton action, which is why we take the potential V (o) as an arbitrary 
function. But spacetime derivatives of the inflaton field yield factors of 
order g/a, so each additional spacetime derivative would introduce a factor 
of order g/aM. At horizon exit, g/a equals H, which we have seen is of 
order 10!4 GeV, while we would expect M to be much larger, somewhere 


Phys. Rev. D 66, 023518 (2002); Phys. Rev. D 67, 063508 (2003) [hep-th/0110226]; R. H. Brandenberger, 
S. E. Joras, and J. Martin, Phys. Rev. D 66, 083514 (2002) [hep-th/0112122]; N. Kaloper, M. Kleban, 
A. Lawrence, and S. Shenker, Phys. Rev. D 66, 123510 (2002) [hep-th/0201158]; N. Kaloper, M. Kleban, 
A. Lawrence, S. Shenker, and L. Susskind, J. High Energy Phys. 0211, 037 (2002) [hep-th/020923 1]; 
U. H. Danielsson, Phys. Rev. D 66, 023511 (2002); L. Bergström and U. H. Danielsson, J. High Energy 
Phys. 12, 38 (2002); C. P. Burgess, J. M. Cline, F. Lemieux, and R. Holman, J. High Energy Phys. 2, 48 
(2003); J. Martin and R. Brandenberger, Phys. Rev. D 68, 063513 (2003); J. Martin and C. Ringeval, 
Phys. Rev. D 69, 083515 (2004); T. Okamoto and E. A. Lim, Phys. Rev. D 69, 083519 (2004). R. de 
Putter and E. V. Linder, 0705.0400. 
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in the range from |V|!/4 ~ 10!6 GeV to 1//G ~ 10! GeV. Hence it is 
plausible that at and after horizon exit, and for some time before horizon 
exit, terms with more than the minimum number of spacetime derivatives 
are suppressed. This is all we need to justify the calculations of this section, 
at least as a good first approximation, as long as we stick to a single inflaton 
field. The possibility of more than one inflaton field will be discussed in 
Section 10.4. 


10.2 Tensor fluctuations during inflation 


The fluctuations in the tensor field D(x, t) during inflation can be treated 
in much the same way as the scalar fluctuations considered in the previous 
section. Since the tensor anisotropic inertia ni vanishes for scalar field 
theories, the field equation (5.1.53) for the tensor modes takes the simple 
form 

V?Dj — a’ Dy — 3aåDj = 0. (10.2.1) 
We recall also that Dj satisfies conditions that eliminate any vector or scalar 
contributions: 


Dj = Dï, Di=0, @Dj=0. (10.2.2) 


The plane wave solutions have the form eij Datex, where D4 (t) satisfies 
the differential equation 


Dy + 3HD, + (¢7/a)D, = 0 (10.2.3) 
and e;j is a time-independent polarization tensor satisfying the conditions 
eij = eji » ei = 0 ; qieij = 0. (10.2.4) 


We recall from Section 5.2 that for a given unit vector ĝ, there are two 
independent polarization tensors satisfying these conditions. For ĝ in the 
three-direction, these can be chosen to have components 


en = -en =1/V¥2, eņn=en =Hi/V2, ez =e =0. (10.2.5) 


For g in any other direction we define ej to be the tensor obtained by 
applying to (10.2.5) the standard rotation (7.4.12) that takes the three-axis 
into the direction of g. The polarization tensors constructed in this way are 
called e; (ĝ, +2), because Eq. (10.2.5) describes a wave of helicity +2. 

At early times, when g/a > H, Eq. (10.2.3) has WKB solutions of the 
form 


D,(t) —> h(t) ex (-i ig a) (10.2.6) 
q p q n ay)’ i2. 
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where A(t) varies much more slowly than the argument of the exponential, 
and t, is arbitrary. Then the terms in Eq. (10.2.3) of second order in q/a 
cancel, while the terms of first order give 2h + 2Hh = 0, so that h œ aa! 
From now on we will define D4 (t) as the solution of Eq. (10.2.3) which is 


normalized so that, for a(t) > 0, 


V 167 G t dt 
D —i : 10.2.7 
ace (21r)3/2./2q a(t) E ( g J. 5) i ) 


Since D,(t) and A) are a complete set of solutions of Eq. (10.2.3), the 
most general real tensor field satisfying the conditions (10.2.1) and (10.2.2) 
takes the form 


Dy(x.) = YO J d°q [D 0e" bq, ei], 2) 


AE? 
+D* (Ne B*(q, Nez, 0] (10.2.8) 


(That is, instead of characterizing two independent solutions Dıq(t) and 
D(t) of Eq. (10.2.3) by their behavior at late times, as we did in Section 
5.2, we now take them as D,(t) and DUO), characterized by their behavior 
at early times.) With D,(t) normalized to satisfy Eq. (10.2.7), the canonical 
commutation relations require that 


[B(q.4), Ba’. AI =0, BA), B*(q’,A)] = 83(q aw . (10.2.9) 


Thus £ (q, A) and §* (q, à) can be interpreted as the annihilation and creation 
operators for a graviton of helicity A. 

As in the case of scalar perturbations, we assume that during infla- 
tion the universe is in a quantum state |0) satisfying the vacuum condition 
(q, A)|0) = 0. Then expectation values of products of Ds are Gaussian, 
with pairings 


(01D; (x, t) Dx (y, t)|0) = J Pq eD DO Tya (10.2.10) 
where Myx (ĝ) is the helicity sum given by Eq. (5.2.25): 
Mykel) = X eG.) eke, A) 
À 


= dix dje + diedjx — Ôijôke + Si Gk Ge + Skeĝiĝj — Sikĝje — SiG Ik 
— Sie Gige — SjeGidk + UUGIUKQ- (10.2.11) 


For any given potential, the function D,(t) is to be calculated by inte- 
grating the differential equation (10.2.3) with the initial condition (10.2.7). 
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Outside the horizon, for g/a « H, the solution becomes a constant 
Dj, which provides an initial condition for the gravitational wave when 
it re-enters the horizon. In carrying this out, it is again useful to replace 
ordinary time as the independent variable with conformal time 


CO / 
2 -f a. (10.2.12) 
© a(t’) 
and take ¢, = oo in Eq. (10.2.7). Eq. (10.2.3) then becomes 
dD dD 
2Ha—! +g D; =0. 10.2.1 
q+ ee TERETA (10.2.13) 
Note that F ig 
a a 
Ha=— =-—. 
% dt adr 


Thus Eq. (10.2.13) is the same as the Mukhanov-—Sasaki equation (10.1.35) 
except that z = a@/H is replaced with a. 

The initial condition (10.2.7) for g/aH >> 1 is independent of the details 
of the potential, while for g/aH < 1 the tensor amplitude D,(t) simply 
approaches a constant Dj. Hence, just as for the scalar amplitude RI, the 
tensor amplitude D% outside the horizon can only depend on the behavior 
of the potential at values of g(f) at around the time of horizon exit, when 
q/aH is of order unity. Thus the measurements of the tensor amplitude after 
horizon re-entry can only tell us about the last V e-foldings of inflation, with 
N bounded by Eq. (10.1.40), just as for scalar modes. 

As for scalar modes, it is useful to consider as a test case the one 
potential for which D7 can be calculated analytically without relying on 
the slow-roll approximation, the exponential potential (10.1.44). For this 
potential @/H is the constant 2€/A, so Eq. (10.1.37) gives z œ a, and in 
this case Eq. (10.2.13) is precisely the same as the Mukhanov-—Sasaki equa- 
tion. Also, the initial condition (10.2.7) is the same as the initial condition 
(10.1.38), except for a factor —2ev 16r G/à = —2,/e. Hence for the exp- 
onential potential: 


Dy(t)/Rg(t) = —2eV 160 G/d = -2/6 . (10.2.14) 


Of course, the values Di and RI outside the horizon then also have this ratio. 
This is usually expressed in terms of a scalar tensor ratio, conventionally 
defined as 
rq = 4D /R2I? . (10.2.15) 
We see that for the exponential potential, ry has the wavelength-independent 
value 
r= l6e. (10.2.16) 
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We saw in the previous section that for the exponential potential the third- 
year WMAP results give € = 0.021 + 0.008, so that r = 0.34 + 0.13. This is 
almost incompatible with the upper bound r < 0.3 on r set (for this value of 
€) by the third-year WMAP results,! so a potential V (p) that is exponential 
around the value that g takes at the time of horizon exit is almost ruled out. 
(Of course, we already knew that the potential could not be exponential over 
the whole range of g, for then inflation would never end, but we are here 
not relying on any assumption about the form of the potential except for 
the values of g taken around the time of horizon exit.) To analyze tensor as 
well as scalar perturbations for more general potentials, we need to invoke 
the slow-roll approximation, to which we turn in the next section. 


10.3 Fluctuations during inflation: The 
slow-roll approximation 


It is not possible to calculate the scalar and tensor perturbations R,(t) 
and D,(t) analytically for general potentials. However, the need (discussed 
in Chapter 4) for a substantial number of e-foldings of expansion during 
inflation suggests that H should have been slowly varying during an era 
long compared with 1/H. We will therefore now assume that H varies little 
throughout a “slow-roll” era, during which g/aH goes from much less to 
much greater than unity. We saw in Sections 10.1 and 10.2 that the asymp- 
totic values Rg and Dj depend only on the evolution of the fields around 
the time of horizon crossing, when g/a ~ H, so it will not be necessary 
for us to assume that the slow-roll era extends back to the beginning of the 
expansion, or forward to the end of inflation. 
We will work with the Mukhanov-—Sasaki equation (10.1.35):! 


BR 2 dz dRg 


2 
Rg =0 10.3.1 
Ge ade ae eee ee 
where zt is the conformal time, and 
ap 
Ss, 10.3.2 
z= (10.3.2) 


1 See Figure 14 of D. Spergel et al., Astrophys. J. Suppl. Ser. 170, 377 (2007) [astro-ph/0603449]. 
' This is usually given in the equivalent form 


where ug = ZRqą. It is easier to work with it in the form (10.3.1), if only because in this way we only 
need to calculate the first derivative of z. 
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Recalling once again that H = —42 G@”, we can write 
ld 
See!) (10.3.3) 
z dt 


where for a general potential 
€=—-H/H’?, 8=H/2HH. (10.3.4) 


We also need a formula for aH in terms of t. For this purpose, we note that 


d (1 
Sf e 10.3. 
=(=) +e (10.3.5) 


So far, everything is exact. The slow-roll approximation requires that 
€ and 6 are small during the era of horizon crossing, which has the conse- 
quence that e varies little during this era, because 


aor (+5) H. (10.3.6) 


We will also assume that ô varies little? (Of course, € and 6 cannot be 
strictly constant except for an exponential potential, for which H « 1/t 
and therefore 6 = —e.) Integrating Eq. (10.3.5) then gives, for a suitable 
choice of an additive constant in T, 


1 


AE eee 


(10.3.7) 


As in the case of the exponential potential, t is negative, and its magnitude 
goes from —t > 1/q early in inflation when q/aH > 1, to —t « 1/q for 
q/aH <« 1. Using Eqs. (10.3.3) and (10.3.7) in Eq. (10.3.1) then gives, to 
first order in € and ô, 


Ry 21+6+42€) dRg 
dt? T dt 


The general solution of this equation for constant 6 and e€ is a linear com- 
bination of 1HP (—qr) and tH (—qr), where now 


+R =0, (10.3.8) 


3 
v=3+2e+8. (10.3.9) 


2This is the case for a power-law potential, under the same condition |g| >> 1/47 G that was found 
necessary in Section 4.2 to justify the slow-roll approximation. For the consequences of dropping this 
assumption, see S. Dodelson and E. D. Stewart, Phys. Rev. D 65, 101301 (2002) [astro-ph/0109354)]; 
E. D. Stewart, Phys. Rev. D 65, 103508 (2002) [astro-ph/0110322]. 
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This agrees to first order in e with the result (10.1.51) for the exponential 
potential, for which 6 = —e. 

Now, we are not assuming that the slow-roll approximation applies all 
the way back to the beginning of the expansion, but fortunately this is not 
necessary. Eq. (10.1.38) should still apply by the beginning of the slow-roll 
era, when q/aH is still very large, so this initial condition fixes the solution 
during the whole of the slow-roll era as 


STT , 
Rg(t) = —— e in 7D (gr) . 10.3.10 
gb) QOny3?2 a(t) v ( qT) ( ) 
(Note that Eqs. (10.3.3) and (10.3.7) give z(t) x t™”+!/2 during the slow- 


roll era.) Then late in the slow-roll era, when g/aH < 1, Eq. (10.3.10) has 
the asymptotic value 


R? a V/ =t (v) eity/2+in/4 Tat a (10 3 11) 
am OnO 2 i i 


which is constant because Eqs. (10.3.3) and (10.3.7) give z « t™”+!/2, Thus 
R has the q-dependence? 


R aa tag ere (10.3.12) 


This may be regarded as a generalization of the result Rj « gq >/2-€ that 
we found for the exponential potential, to the case where ô 4 —e. 

Because Rå is time-independent, it can be calculated by setting ¢ in 
Eq. (10.3.11) to any convenient value. We shall evaluate it at the time tg 
of horizon crossing, defined as in Sections 10.1 and 10.2 by 


q/a(tq) = H (t4) . (10.3.13) 


(But note that Eq. (10.3.10) with t = t, does not give the correct value of 
Rj.) Ignoring corrections of order € or ô except in exponents, Eq. (10.3.7) 
gives 
(tg) (10.3.14) 
T(tg) = -———— = --. Ok 
í d-q4 q 


Also, H= —4r Go’, so 


P(tg) = yY —H (tg) /V4nG = H (ta) J € (tg) / VANG . 


3E. D. Stewart and D. H. Lyth, Phys. Lett. B 302, 171 (1993). This calculation was carried to 
the next order of the slow-roll approximation by A. R. Liddle and M. S. Turner, Phys. Rev. D 50, 
758 (1994). For a review, see J. E. Lidsey, A. R. Liddle, E. W. Kolb, E. J. Copeland, T. Barreiro, and 
M. Abney, Rev. Mod. Phys. 69, 373 (1997). 
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and so 


+,/€(tg)q 
——. 10.3.15 
H (tg) 4n G ( ) 


The slow-roll approximation thus gives* 


pa A162 Gq- 77H (tq) 

=> l . 
pe 8273/2 Selta) 
This is the same as the result (10.1.56) for an exponential potential, except 
that the factor 1/,/e(¢,) now contributes to the q-dependence of Rj. (To 
check that the q dependence of H (tq) and e (tq) gives the extra factor ee 
we differentiate Eq. (10.3.13) with respect to q, and find 


dtg 1 


dd a(ty) (B) + H()) l 


Z(tq) S 


(10.3.16) 


Then ; 
q dH(ta) | H (tq) = € (tq) 


Ha) d H(t,)+H (tg) 1e) 
and Eq. (10.3.6) gives 


q delta) — 2(€(tg) + 5(tq)) 


elta) dq 1 — e (t4) 


Hence, replacing the denominators 1 — e with unity, H (tọ) « q © and 
E(tg) x q+, and so H(tg)/./€(tg) x q?/?-”, which gives Eq. (10.3.16) 
the g-dependence (10.3.12).) 

Now let us apply the slow-roll approximation to the tensor modes. In 
general, the tensor wave equation (10.2.3) can be written as 


dD, dD 
— 4 4 24H! + q’D, =0. 10.3.17 
Ze ge ee 0217) 
During the slow-roll era, we can use Eq. (10.3.7) to put this in the form 
d’Dy 2 dD, 
dr? (d-er dt 


+g D =0. (10.3.18) 


45, W. Hawking, Phys. Lett. B 115, 295 (1982); A. A. Starobinsky, Phys. Lett. B 117, 175 (1982); 
A. Guth and S.-Y. Pi, Phys. Rev. Lett. 49, 1110 (1982); J. M. Bardeen, P. J. Steinhardt and M. S. Turner, 
Phys. Rev. D 28, 679 1983); D. H. Lyth, Phys. Lett. B 147, 403 (1984); B 150, 465 (1985); Phys. Rev. D 
31, 1792 (1985). 
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The general solution for constant € is a linear combination of t“ H au (—qt) 
and tH) (—qr), with 
3 € 
C=) T l—e` 
The solution during the slow-roll era that satisfies the initial condition 
(10.2.7) is 


(10.3.19) 


v l6rG 
T exp(iux/2 +in/4) HP(— qT). 
(27)3/2,/2ga(t) 
(10.3.20) 


(Note that Eq. (10.3.7) gives (t/a)da/dt = aHt = —1/(1—€), soa « 
t~!/C-©) and therefore /—t/a œ t”.) The asymptotic solution for g/a < 
H is then 


D(t) = 


_,vl61G T'(p)./f— qc\ & 
D? = 2 4 . (10.3.21 
7 S mO a T expliur/ + in/ (+ 5 ) (10.3.21) 
Thus D% has the g-dependence?* 
Dg ig, (10.3.22) 


In contrast with the case of the exponential potential, the asymptotic 
q-dependence of the tensor modes is in general different from that of the 
scalar modes. 

We can give a more convenient expression for Dj by setting t = tq in 
Eq. (10.3.21). Then using Eq. (10.3.14) and a(ty) = q/H (tg) and taking 
e — 0 everywhere but in the g-dependence of H(t,), Eq. (10.3.21) gives> 


po — ;VITGH (ty) 


It is conventional to write the q-dependence of the squared magnitudes 
of the tensor and scalar amplitudes outside the horizon as 


Der gene... AR g are: (10.3.24) 
Then in the slow-roll approximation, Eqs. (10.3.12) and (10.3.22) give 
nT (q) = —2e (t4) , ns(q) = 1 — 4€(tg) — 28 (tq) . (10.3.25) 


(As a check, recall that for the exponential potential 6 = —e, so Eq. (10.3.25) 
gives ns = | — 2e, in agreement with the result of Section 10.1.) Also, 


5A. A. Starobinsky, JETP Lett. 30, 683 (1979). 
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comparison of Eqs. (10.3.16) and (10.3.23) yields the relation 
r(q) = 16e(tg) = —8nr (q) , (10.3.26) 


where, with the conventional definition of r, r(q) = 4|Dj [RE . This rela- 
tion among measurable quantities is known in the literature as the slow-roll 
consistency condition. For any potential other than the exponential poten- 
tial the scalar/tensor ratio r depends on q. 

For inflation with a single inflaton field, the relation H = —4r Go? tells 
us that e(t) is always positive, but (t) can have either sign, so in general 
Eq. (10.3.25) gives nr (q) < 0, while ns(q) can be greater or less than unity. 
Nevertheless, experience with many models shows® that physically plausible 
potentials that are not finely tuned tend to have ns(q) less than unity, and 
even less than 0.98. But for slow-roll inflation, € and 6 are small, so ns(q) 
cannot be very much less than unity. Thus the general picture of slow-roll 
inflation received some support from the third-year WMAP result quoted 
in Section 7.2, that ns = 0.958 + 0.016. 

As we saw in Chapter 7, the quantities PID and PIRI? provide a 
measure of the contribution of tensor and scalar fluctuations to the multi- 
pole coefficients Crr¢ in the angular distribution of the cosmic microwave 
background temperature. From Eq. (10.3.26) we can see that the tensor 
modes are likely to contribute much less to the Crr e than the scalar modes. 
Also, Eq. (10.3.16) and the fact that anisotropies in the cosmic microwave 
background temperature are small but not too small to be observed indi- 
cates that the Hubble constant during the slow-roll era must be small com- 
pared with the Planck mass 1 / JG, but not too small. In the slow-roll limit, 
where v ~ 3/2, we can write IRI? = |N|?q~3, and Eq. (10.3.16) shows 
that 


ne 16r GH?.., _ (82 O) exit 
6473 l€exit| 9673 lEexit| , 


where the subscript “exit” denotes the time of horizon exit, and in accor- 
dance with the slow-roll approximation we here ignore the weak dependence 
of this time on q. As we saw in Section 7.2, the factor |N]? has the value 
(1.93 + 0.12) x 10-!9, so 


Pexit L 16.70 +0.10) x 10!° Gevj*. (10.3.27) 


l€exit | 


6M. B. Hoffman and M. S. Turner, Phys. Rev D 64, 023506 (2001); W. H. Kinney, Phys. Rev. D 66, 
083508 (2002); H. V. Peiris et al., Astrophys. J. Suppl. Ser. 1148, 213 (2003); G. Efstathiou and K. J. 
Mack, J. Cosm. Astropart. Phys. 05, 008 (2005); L. A. Boyle, P. J. Steinhardt, and N. Turok, Phys. Rev. 
Lett. 96, 111301 (2006). 
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Measurements of cosmic microwave background anisotropies have so far 
been sensitive only to the spectral index of the scalar rather than the tensor 
modes, and therefore have yielded information only on 6 + 2e, not €, but 
unless there is a cancelation between 6 and 2e, these measurements suggest 
that |e] is probably not much greater than a few percent. If for instance 
we take |élexit = 0.05, then exit ~ [3.2 x 10!° GeV]*. In any case, we 
now see that in inflationary theories, the smallness of cosmic fluctuations 
before horizon re-entry is simply a reflection of the fact that, for reasons 
that are still mysterious, the energy scale defined by the energy density of 
the universe at horizon exit is a few orders of magnitude less than the Planck 
energy scale, (87 G)~!/2 = 2.4 x 10!8 GeV. 

This mystery is strongly reminiscent of another mystery encountered in 
elementary particle physics: the unification energy scale, where the three 
coupling constants of the electroweak and strong interactions all come 
together,’ is about 2 x 10!° GeV, also a few orders of magnitude less than 
the Planck energy scale (82 G)~'/*. Perhaps they are the same mystery. 

The measured values of ns and |N| and the observational upper limit 
on the tensor/scalar ratio r already allow us to put useful constraints on the 
inflaton potential. We saw in the previous section that this data is close to 
ruling out any potential V(@) with an exponential dependence on ø for the 
values that g(t) takes around the time of horizon exit. To go further, it is 
useful first to express € and ô in terms of the potential. Using the general 
relation (4.2.3) 


H =—4n GÖ? 3 
and the slow-roll formula (4.2.8) 
vo ___ vø 
3H 24n GV (9) ” 
gives 
1 (eo) 
E(t) = (10.3.28) 
16zG v(oo) 
Also, the time-derivative of Eq. (4.2.3) gives H = —82G@@, and using 
Eq. (4.2.12) then gives 
1 (veao) 2V"(G0) 
ôl) = = (10.3.29) 


lorG \ 2g) rào) 


7See e.g. QTF, Vol. II, Sec. 28.2. 
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For instance, for a power-law potential V (9) « g“, we have 


a? 2a — a? 
e(t) = 161 G@2(t) > ô(t) = l6nGeep 5 (10.3.30) 
so ô(t) = (2/a — 1)e(t), and therefore 
A oe oe CL (10.3.31) 
8a 


The experimental bound on ns depends on the value assumed for r, so 
observations define an allowed region in the ns-r plane.’ At present, the 
straight line (10.3.31) intersects the (68% confidence level) allowed region 
for all positive a, even when WMAP three year data is combined with 
data from the CBI and VSA microwave backgrounds, or from the Sloan 
or 2dF sky surveys. However, low values of œ are favored, and even a 
modest shrinking of the allowed area would rule out high values 
of a. 

To go further, we need to say something about the value of the scalar field 
at horizon exit. For this purpose, we can make use of the relation (4.2.14), 
which gives the number AN of e-foldings when the scalar field goes from 
gı to y2, under the assumption that the slow-roll approximation holds over 


this period, as 
p2 
a=- f (SA) ae. (10.3.32) 
o V'(@) 


1 


If |V’/V| ~ v 16x Ge is essentially constant over the range of g from g; to 
g2, then the number of e-foldings associated with this change in ¢ is 


AN = Agy 4r Gje (10.3.33) 


Lyth? has used this relation in the case where gı and g2 are the field values 
at horizon exit for wave numbers corresponding to £ ~ 1 and £ ~ 100, 
for which AN = In 100 = 4.6, to show that if e is large enough to give a 
detectable tensor mode, then the scalar field must change by an amount that 
is at least as large as the Planck scale 1/./47G. 

If we make the strong assumption that the slow roll approximation holds 
over the whole era from horizon exit to the end of inflation, but do not now 
assume that € is necessarily constant through this era, then for a power-law 


8See Figure 14 of D. Spergel et al., Astrophys. J. Suppl. Ser. 170, 377 (2007) [astro-ph/0603449]. 

°D. H. Lyth, Phys. Rev. Lett. 78, 1861 (1997) [hep-ph/9606387]. Also see G. Efstathiou and K. J. 
Mack, J. Cosm. & Astropart. Phys. 05, 008 (2005) [astro-ph/0503360]; R. Easther, W. H. Kinney, and 
B. A. Powell, J. Cosm. Astropart. Phys. 08, 004 (2006) [astro-ph/0601276]. 
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potential V(g) x g*, Eq. (10.3.32) gives the number of e-foldings from 
horizon exit for a wave number q to the end of inflation as 


4a G 
NO = SPa) - Plena) | (10.3.34) 


where t4 and feng are the times of horizon exit and the end of inflation, 
respectively. If we further assume that |Ø (tena)| < |(t,)|, then 


-2,7 AN (Q) 

Pte) ~ Fae (10.3.35) 
so Eq. (10.3.30) gives 

a =g 

Et) ~ Tig)’ Co) ~ ag’ 
and therefore!? 
a+2 4a 
ns(q) == ING’ r(q) = Ko (10.3.36) 


We noted in Section 10.1 that if the energy density at the beginning of 
the radiation-dominated era is the Planck density G~*, then N (q) ~ 68 
for wave numbers that are just coming into the horizon at the present, and 
correspondingly less for larger wave numbers; for instance, for wave numbers 
corresponding to £ = 100, M would be less by an amount In 100 = 4.6. To 
derive a better estimate of M, we can use Eq. (10.3.27). Under the risky 
assumption that the energy density pı at the beginning of the radiation- 
dominated era is the same as at horizon exit, and taking € = O(.02), we 
have p1 ~ [2.5 x 10! GeV]!/4, so Eq. (10.1.43) with h = 0.7 shows that 
for a wave number that just enters the horizon at the present, M ~ 62, 
while for the wave number corresponding to £ ~ 100, M ~ 57. Taking 
N = 60, for a quadratic potential Eq. (10.3.36) gives ns = .97 and r = 
0.13, which is consistent with the WMAP third-year results, while for a 
quartic potential Eq. (10.3.35) gives ng = 0.95 andr = 0.26, which is barely 
outside the range allowed by WMAP!! Any a > 4 is ruled out. But this 
conclusion is contingent on the assumption of a slow-roll inflation from the 
time of horizon exit until the end of inflation, with V(¢(t)) « Ø% (t) over this 
whole period. 


10D, H. Lyth and A. Riotto, Phys. Rep. 314, 1 (1999). 
Ip, Spergel, ref. 8. 
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10.4 Multifield inflation 


Observations of the cosmic microwave background and large scale structure 
indicate that the primordial scalar fluctuations outside the horizon are 


e nearly Gaussian, 
e adiabatic, 


* nearly scale invariant, in the sense that Rg is nearly proportional to 
-3/2 
q > 


* weak, in the sense that q7/?R? « 1. (We saw in Section 7.2 that q? R8 
is of order 1075.) 


We have seen in Sections 10.1 and 10.3 that these properties of primordial 
scalar fluctuations follow under the assumptions that 


1. The energy density during inflation receives appreciable contributions 
from just a single real “inflaton” scalar field. This implies that the 
fluctuations are adiabatic during inflation, in which case they remain 
so thereafter. 


2. During the era of horizon exit (say, for q/aH falling from 10 to 0.1), H 
is sufficiently small so that q/a is less than whatever fundamental scale 
(such as the grand unification scale or the Planck scale) characterizes 
the theory, not only during this era but for some time before it. (For 
the case considered in Section 10.1, we estimated that H ~ 10!4 GeV, 
which is probably small enough.) This implies that during this era the 
scalar field is described by a simple effective action, involving no more 
than two spacetime derivatives. It follows that for some time before 
the era of horizon exit the inflaton behaves like a free field, so that the 
fluctuations are Gaussian. 


3. For observed fluctuations, in the era of horizon exit inflation is “slow- 
roll,” in the sense that \H|/H* « 1 and|H/HH| « 1. Together with 
assumptions | and 2, this implies that the fluctuations are nearly scale 
invariant. 


But there is no particular reason to believe that the energy density during 
inflation is dominated by a single scalar field, so we are naturally led to 
consider the case of several inflaton fields ø” (x). We shall show that the same 
properties of primordial scalar fluctuations follow if we make assumptions 
2 and 3, but replace assumption 1 with 
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1*. The energy in all the scalar fields is converted at the end of inflation into 
ordinary matter and radiation in local thermal equilibrium, in which all 
chemical potentials vanish. (Baryon and lepton number would then 
have to be generated later, as discussed in Section 3.3.) 


Whatever the number of scalar fields, under assumption 2, the effective 
action of the scalar fields is dominated by terms with a minimum number of 
spacetime derivatives during and after the era of horizon exit and for some 
time before it. For arbitrary numbers of scalar fields, the most general such 
action takes the form 


1 19 
yal DE [-5e" RC ye -v| (10.4.1) 


Ox! ax” 


where V (ọ) is an arbitrary real potential, repeated scalar field indices are 
summed, and Ynm(ọ) is an arbitrary real symmetric positive-definite matrix, 
which we shall call the field metric. (This matrix must be positive-definite to 
give the right sign to commutators of fields and their time derivatives.) The 
energy-momentum tensor, which serves as the source of the gravitational 
field, is derived as described in Appendix B from the action (10.4.1), and 
takes the form 

ag" 


Tys EAP vam) A 
uv = Suv aS Yam\?) 9 5 


ax ax” 


= 


. (10.4.2) 


7 - vo) + Ynm(Q) —— 


The scalar field equations are derived from the principle that J, must be 
stationary with respect to infinitesimal variations in the the scalar fields, 
and take the Euler-Lagrange form 


0 n 
soa (V=Dae ymo e) = /=Dete 


(; uv Yml) Og! ag” ro) 
x ., 


10.4.3 
2° ag" axe ax” ag” euka) 


We take each scalar field y”(x) as an unperturbed term ø”(t) that 
depends only on time, plus a small perturbation ôg” (x, t): 


P” (x,t) = P” (t) + 8g” (x, t) . (10.4.4) 


Similarly, as in Chapters 5-8, the metric is assumed to be given by the 
unperturbed Robertson-Walker metric 2,» (t) (with K = 0) plus a small 
perturbation Auv (x, t) 


Ew, t) = Zw (t) + hy (, t). (10.4.5) 
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The unperturbed energy-momentum tensor is of the perfect fluid 
form (5.1.35), with unperturbed energy density, pressure, and velocity 


=æ 1 ia =. = 
p= 5 Ynm(P) erp +V@), (10.4.6) 
aa 1 pee es ay. 
p= 5 Yam (Peg —V@), (10.4.7) 
wm=1, w=0. (10.4.8) 


The scalar field equation (10.4.3) for the unperturbed fields is 


3V (p) 
og” 


p” +y" (Pog! + 3HE" + y""@) =0, (10.4.9) 


where y™” is the reciprocal of the matrix ym, y; is the affine connection 


in field space: 


ng) = lyko (22 IVl) a) 
ml > 


10.4.10 
ag! ag” agk 
and H is the expansion rate H = åà/a = ./81Gp/3. The reader may 
check that Eq. (10.4.9) guarantees that the energy-conservation equation 
p = —3H (p + p) is satisfied by the quantities (10.4.6) and (10.4.7). From 
Eqs. (10.4.6) and (10.4.9) we find the convenient formula 


H = —41 Gypm(@)G" 0" . (10.4.11) 


For more than one scalar field T,» is not of the perfect fluid form 
(5.1.35) to all orders in perturbations, but by comparing the first-order 
terms in Eq. (10.4.2) with Eqs. (5.1.39)-(5.1.41), we see that to first order 
the anisotropic inertia vanishes, and the perturbations to the energy density, 
pressure, and velocity potential are 


g” g” Yam (P) 


5) = yam(G)G" 8G” + = 


av agk 
aV (p) 1 ee 
gg aa ie eg (10.4.12) 
l. Ynn (P) 
5P = Vam (P) SH” + 59"0 g'g” r 
ƏV (p) 1 eee 
Da 8g" + =hooYnm(@)G"@" , (10.4.13) 
IP 2 
= Sng, m 
bu = ZOTE. (10.4.14) 
VEUP)P* P 
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The reader can easily check that these formulas reduce to Eqs. (10.1.9)— 
(10.1.11) in the single-field case, with yj; = 1. 

Since there is no first-order anisotropic inertia, in Newtonian gauge we 
have ® = W, so hoo = —2W, and the Einstein field equation Eq. (5.3.21) 
takes the form 

W + HY = 4r G yam(G)G" 50" , (10.4.15) 


The terms of first order in the field equation Eq. (10.4.3) are much simpli- 

fied if we now adopt a notation that reveals the transformation of quantities 

under redefinitions g” — g’"() of the scalar fields. Under such transfor- 

mations, quantities like g” and ôg” transform as contravariant vectors, in 

the sense that of E 

so” =P am, groz gn (10.4.16) 

dg” 

For any vector v” that transforms in this way, we can define a rate of change 
that is also a vector: 

D 


ð 
we u” =e —vy" + yf (gu! u” (10.4.17) 


With this notation, the first-order terms in the field equation (10.4.3) give 


D? D V () v? 
=} 3H De nm 5 5 
DE g” + o” +y oA a ag! g' - a p" 
ro 


_2ym@gyw À +40 P+ y"me@ posp, (10.4.18) 


where y” mk (Ọ) is the Riemann-Christoffel tensor in field space: 


avr (p) E dy.) 
agk ag! 


V"imk(P) = + Vin PVP) — YP) Ymr (P) - (10.4.19) 


(The scalar fields can be redefined to make yym = ônm if and only if y” mk = 


0. We are not assuming that this is the case.) Also, the constraint (5.3.26) 
is here 


a ve = +n D m m D +n 
(1 - =) Y= 4x Gynn(@)( =O" + 89" =o ) - (0.4.20) 


The solutions are written as superpositions of plane waves 
ôg” (x, t) = 
2 J d'a] [pfe “alq, N) + PN, (Det Xa*(q, N)| (10.4.21) 
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V(x,f) = 
J dq [Yn Dea, N) + VR, Oea", N) |. (10.4.22) 


Here N labels different solutions of the coupled equations (10.4.15)— 
(10.4.17), with V? replaced with —g: 


Wg + H Yng = 4G Yum)" Pq » (10.4.23) 
De. D avg 7, [È 
pn? o” Haan 3H 52a + y” (Ọ) agm ag! ôP] + J2 õp 
= dV (p) © = ae. eps 
= 2y) Va og FAV OF Ym) ọso, (10.4.24) 


2 
; T = D nD D >n 
(i + 2) Y = 4x Gynm(@)( — p" Di 5g, + 5g,’ — pe? o"). (10.4.25) 


There is one second-order equation for each scalar field, and one first- 
order equation for Y, so with one constraint on first derivatives the number 
of independent solutions equals twice the number of scalar fields. Since 
(pN g Wnq) and (pN, g Wy g) are all independent solutions, the index N 
takes as many values as the index n. 

To find the initial conditions for Eqs. (10.4.23)—(10.4.25), and to find the 
commutation relations for the operators œ (q) and a* (q) in Eqs. (10.4.21) and 
(19.4.22), we note that for some time before the era of horizon exit we will 
have q/a >> H, and q? /a? much greater than any element of 83? V /3g” 3g”. 
Hence the solutions up to the beginning of the era of horizon exit take the 


WKB form 
Nq Nq p q ” C) ’ 


t dt! 
Ww —i ; 10.4.26 
Nq(t) > gNgq(t) exp ( iq f =) ( ) 


where f. Ng!) and gnq(t) vary much more slowly than the argument of the 
exponential, and ¢, is arbitrary. Eqs. (10.4.23) and (10.4.25) are both satis- 
fied to leading order in q/a if we take 


4in Ga 


ENa = Yum DGSE, « (10.4.27) 
The terms in Eq. (10.4.24) of first order in g/a then give 


D 
pila + Hfi = 9 - (10.4.28) 
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To solve this, we note that, because y™” (a0) is positive-definite, it can be 


written in terms of a set of vielbein vectors eù (£) (with N running over as 
many values as n), as 


yn (a0) =F AORA) . (10.4.29) 
N 
These vielbeins can be defined to satisfy the equation of parallel transport! 
D 
pM =0, (10.4.30) 
so the solution of Eq. (10.4.28) is 
fig) «a7! (Den) . (10.4.31) 


For reasons that will soon become apparent, we shall normalize these sol- 
utions so that, for g/a > H, 


Sig = On) P Qq) aT eh, . 


With this normalization, at the beginning of the era of horizon exit 
we have? 


t 7 


1 dt 
IOA = i —i —], 10.4.32 
Png) Or aO ™ (t) exp ( lq [ a) ( ) 


Ain Gynm (pen (t)O" (0) ( f £) 
y = — . (10.4.33 
nq (t) Ons? Jaq td eae eres ( ) 


For times early enough so that g/a >> H the commutation relations for g” 
can be obtained from the action (10.4.1) with Y neglected, and therefore 


' We define the vectors en, i (£) to satisfy the first-order differential equation (10.4.30), and, as an initial 


condition, to satisfy Eq. (10.4.29) at some initial time £ = tį. From Eq. (10.4.30) and the definition 
(10.4.10) it follows that for all times 


k k >l 
Din T [-v Dk, T vf Di Po 


where 
Di = S ee ven ; 
N 

This differential equation for D}, has a solution D/,, = 5/;,, and our initial condition tells us that Di, = ôh, 
at t = t4, so this is the solution for all times. It follows that the vectors en, q® defined in this way satisfy 
the condition (10.4.29) for vielbeins for all times. 

2A result equivalent to Eq. (10.4.32) is given in Eq. (4.4) of H.-C. Lee, M. Sasaki, E. D. Stewart, 
T. Tanaka, and S. Yokoyama, J. Cosm. & Astropart. Phys. 0510, 004 (2005) [astro-ph/0506262], using 
a “SN” formalism due to M. Sasaki and E. D. Stewart, Prog. Theor. Phys. 95, 71 (1996) [astro- 
ph/9507001]. But their paper does not reach the conclusion (10.4.41) found in this section. 
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take the form: 
lyp", D, p” y, 01 =0, 
lo", 1), 6"(y, D= aO y™"(G()) 3K y), 


With dy” normalized as in Eq. (10.4.32), this implies that the time- 
independent operator coefficients in Eqs. (10.4.21) and (10.4.22) satisfy the 
commutation relations 


[ea N, aq, N] =0, [ag.n), a*a, N] = 8a- anw - 
(10.4.34) 


Assuming that there is enough time before horizon exit for the state of the 
world to decay into the Bunch—Davies vacuum |0), with æ (q, N)|0) = 0, it 
follows then from Eqs. (10.4.21) and (10.4.22) that the observed perturb- 
ations will be Gaussian, just as in the single-field case. 

According to assumption 3, during the era when q/a drops from being 
somewhat larger to somewhat smaller than H — say, from 10H to 0.14, 
the scalar fields are rolling slowly down the potential hill. We assume that 
V (ọ) satisfies whatever flatness conditions are necessary to allow us to drop 
all terms in Eqs. (10.4.23)-(10.4.25) proportional to gn or gn and to ignore 
the second derivative of the potential in Eq. (10.4.24). Then during the era 
of horizon exit Eq. (10.4.24) is approximately 


2 
$ ; q 
5GNq + 3H5GNq + (5) png = 9. (10.4.35) 
H is roughly constant during this era,* so the independent solutions of 
Eq. (10.4.35) are proportional to (1 + igt)exp(—igt) and its complex 
conjugate, where t is again the conformal time 


[ dt’ 1 

T= S . 

œ A(t") Ha(t) 

The scalar field perturbations at the beginning of the era of horizon exit are 
given by Eq. (10.4.32), so during this era we have 


1 1 iH\ ,, 
by", ~ ( zs jreg , 10.4.36 
PNq (2)3/2,/2q a q N ( ) 


3The fractional change in H during the era of horizon exit is |Ħ/H| x In(100)/H, which is small if 
|H|/H2 « 1/1n(100) = 0.22. 
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By the end of the era of horizon exit we have g/a « H, and the scalar field 
perturbations approach the quantities 
iH 
L (10.4.37) 


(27)3/2 /2q? 


Since H and eù are slowly varying, they can be evaluated at the time of 
horizon exit, and depend weakly on q. 

At some time after the era of horizon exit the slow-roll conditions must 
become violated, if only in order that the energy in the inflaton fields can 
eventually be converted into ordinary matter and radiation. The potential 
term on the left-hand side of Eq. (10.4.24) is then no longer negligible, and 
things get complicated. But once g/a becomes much less than H, the subse- 
quent evolution of the scalar fields during inflation cannot depend on q, so 
until horizon re-entry all scalar field perturbations 55, will have the same 


wave number dependence, close to g~>/?, as given by Eq. (10.4.37) at the end 
of the era of horizon exit. The same applies to the scalar metric perturba- 
tions; when a becomes sufficiently large so that g?/a? « |H]|, Eq. (10.4.25) 
gives Vy, approximately proportional to q73/⁄?. 

In general, it is not easy to see what these results imply for the perturb- 
ations observed in the cosmic microwave background or large scale struc- 
ture. However, there is one case where an important conclusion can be 
reached. If according to assumption 1* the energy in all the scalar fields is 
converted at the end of inflation into ordinary matter and radiation in local 
thermal equilibrium, and if at this time all conserved quantities like electric 
charge have zero density, then as remarked in Section 5.4, the perturbations 
become adiabatic, with R, taking a constant value RI until horizon re-entry. 
For small fluctuations, R4 will be some linear combination of the perturba- 
tions ôg% j and Wy, at the end of the era of horizon exit. We do not know the 
coefficients in this linear combination, which in general will depend on the 
shape of the potential experienced as the field evolves until the end of infl- 
ation, as well as on the mechanism of energy transfer to matter and radi- 
ation. But we can be sure that these coefficients are independent of wave 
number, because once the era of horizon exit is over the perturbations are 
far outside the horizon. Hence we can conclude that in this case R will 
have the same wave number dependence as dg, j and Wy, at the end of the 
era of horizon exit, which for slow-roll inflation will be close to g~>/?. 

The ubiquity of the g~?/? wavelength dependence can be understood 
on very general grounds. For negligible spatial curvature, nothing should 
depend on how the co-moving coordinate vector x is normalized, so suppose 
we change its scale by a transformation x > Ax, with à constant. To keep 
q: x unchanged, we must then change the scale of co-moving wave numbers 
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by a transformation q —> A~!q. Consider any perturbation Z(x, £) that, 
like 6g”(x, t) and W(x, t), is given by a Fourier integral 


Z(x,1) =~ J dq [eal N)zng() + ea" Q, Mz] 
N 


where N labels the various solutions of the field equations, and a(q, N) and 
a* (q, N) are annihilation and creation operators satisfying the commutation 
relations 
[ala, N), a*(q’.N’)] = Sy? (q — q’) . 

Under the transformation q — A~!q, the delta functionin this commutation 
relation transforms as 5°(q—q’) —> A*53(q—q’), so we must have a(q, N) > 
3/2a(q, N). Also, of course d?q —> 4~3d3q. Hence in order for Z(x, t) to 
be unaffected by this change of scale, we must have z(t) > A°/?zyg(t). 
This condition is satisfied if z(t) has a g-dependence œ q7?/?, andif it does 
not depend on the scale of a(t). Of course, to keep the physical coordinate 
vector xa(t) independent of the normalization chosen for the co-moving 
coordinates, a(t) has the scale transformation a(t) —> A~!a(t), so g/a(t) 
is scale-invariant, and if zy,_(t) depends on the scale of the function a(t) 
then we cannot conclude that it is proportional to g~7/*. But if ZNq(t) 
like Pig D and Wy,(t), takes a nearly time-independent value zġ, after 
horizon exit, then outside the horizon it will not depend strongly on a(t). It 
could still depend on H(t,), H (tq), etc., where fg is the time of horizon exit, 
defined by the scale-invariant condition q¢/a(t,) = H(t,), but in the limit of 
very slow roll inflation H, H, ete. depend only weakly on time, so ZNq can 
depend only weakly on the scale of the function a, and the scale-invariance 
of Z (x, t) requires that ZNq be nearly proportional to g~*/*. 

The intensity of observed adiabatic fluctuations is related to the quantity 
R defined by Eq. (5.4.1). Using Eqs. (10.4.14), (10.4.11), (10.4.21), and 
(10.4.22), during inflation this is 


R(x, ft) = —W (x, t) + H (t) bux, t) 


= - W(x, 1) + em (a) a" Osp", 
-5 J dq etxat, MRNA + eixa CA, NRO], 
À (10.4.38) 
where 
4r GH (t) 


Roli) = —Wyq(0) + Yam (G()) OSORNO}. (10.4.39) 


H(t) 
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Because of the factor H in the denominator, at the end of the era of horizon 
exit the second term in Eq. (10.4.39) dominates over the term —Wy,, and 
Eq. (10.4.37) then gives at this time 


_ 4nGH? 
lea —— V 
OSPA 


The definition (10.4.29) of the vielbeins and the formula (10.4.11) then lead 
immediately to a sum rule 


1/2 5 
E Peal PE a (10.4.41) 
N 21 q3/*./|H| 


That is, the root-mean-square value of the quantities IR Nq| at the end of the 
era of horizon exit is the same as the value of |R,| outside the horizon in 
the single-field case, given by Eq. (10.3.16). But Eqs. (10.4.38) and (10.4.34) 
tell us that the correlation function of R in the Bunch—Davies vacuum is 


Ryg > (g)pren . (10.4.40) 


J dixe ™® OI (R(x, 1) RY, D) = 2r) Y [Roa] (10.4.42) 
N 


So from Eq. (10.4.41) it follows that the correlation function (R(x, t) R(y, t)) 
is the same at the end of the era of horizon exit as in the single field case. With 
more than one scalar field the correlation function (10.4.42) is not in general 
time-independent outside the horizon, but it is plausible that the value of 
|Rg| during a period of thermal equilibrium after inflation will not be orders 
of magnitude different from Eq. (10.4.41) at the end of the era of horizon 
exit. Thus the observed strength and spectral shape of anisotropies in the 
cosmic microwave background suggests a value of H at horizon exit of order 
10!4 GeV, as in the single field case. 

One can derive stronger results in the case where all but one of the eigen- 
values of the matrix y”” (6) 32V (G)/d@"d¢@! in Eq. (10.4.23) are large and 
positive. In this case, the unperturbed scalar fields roll along the direction 
of the eigenvector for the small eigenvalue, and the only significant perturb- 
ations lie in that direction. The problem then reduces to the single field 
case; we have an essentially adiabatic perturbation, with the amplitude R4 
given in the slow-roll approximation by Eq. (10.3.16), and the slope ns(q) 
given in this approximation by Eq. (10.3.25), where €e and 6 are to be calc- 
ulated in terms of the time-derivatives of the expansion rate by Eq. (10.3.4). 
But there is no known reason why the potential should have the properties 
needed to justify these results. On the other hand, tensor perturbations 
during inflation are governed by Eq. (10.3.17) however many scalar fields 
there are, and the only role played by the scalar fields is to contribute to the 
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Hubble rate H. Thus for slow-roll inflation with any number of scalar fields, 
the tensor amplitude D4 is given by Eq. (10.3.23), and the slope parameter 
nr(q) is given by Eq. (10.3.25), just as in the single field case. 

The fact that observations of cosmic microwave background anisotropies 
and large scale structure indicate that scalar fluctuations outside the horizon 
are adiabatic and Gaussian, with 7 approximately proportional to gr. 
and with q?/ RE <« 1, evidently is consistent with a very large class of 
models of inflation. This is encouraging, because it supports the general 
idea of slow-roll inflation, but also disappointing, because it shows that 
these observations so far do not really tell us anything specific about the 
details of inflation. With further improvements in experimental precision, 
we can look forward to a more decisive test of theories of the early universe. 
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Appendix A 


Some Useful Numbers 


Numerical Constants 


z = 3.1415927 1” = 4.84814 x 10~° radians 
e = 2.7182818 In 10 = 2.3025851 
y = 0.5772157 (3) = 1.2020569 

Physical Constants! 


Speed of light in vacuum c = 2.99792458 x 10!° cm sec7! 
Planck constant h = 6.6260693(11) x 1077 erg-sec 
Reduced Planck constant A = h/2m = 1.05457168(18) x 1077 erg sec 
= 6.58211915(56) x 107? MeV sec 
Electronic charge (unrat.) e = 4.80320441(41) x 107! esu 
Electron volt 1 eV= 1.60217653(14) x 107!” erg 
hc = 197.326968(17) x 10713 MeV cm 
Fine structure constant a =e?/fic= 1/137.03599911(46) 


Electron mass Me = 9.1093826(16) x 10-78 g 
mec? = 0.510998918(44) MeV 
Rydberg energy hcR = mee*/2h* = 13.6056923(12) eV 
Thomson cross section oT = Bret /3m2c4 = 0.665245873(13) x 
10-24 cm? 
Proton mass mp = 1.67262171(29) x 107% g 
mpc? = 938.272029(80) MeV 
Neutron mass mpc? = 939.565360(81) MeV 
Deuteron mass mac? = 1875.61282(16) MeV 
Atomic mass unit m(C!2)/12 = 1.66053886(28) x 10-74 g 
m(C!?)c?/12 = 931.494043(80) MeV 
Avogadro’s number N4 = 6.0221415(10) x 1073/mole 
Boltzmann constant kg = 1.3806505(24) x 10716 erg/K 
= 8.617343(15) x 1075 eV/K 
Radiation energy constant ag = Hi = 7.56577(5) x 107! erg cm~? 
—4 


l From Review of Particle Physics, S. Eidelman et al. (Particle Data Group), Phys. Lett. B 592, 1 
(2004). 
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Weak coupling constant Gwg = 1.16637(1) x 1075 Gev~? 
Gravitational constant G = 6.6742(10) x 1078 dyn cm? g7? 
Planck energy J/hc/G = 1.22090(9) x 10!9 GeV 


Astronomical Constants? 


Julian year l year = 365.25 days = 3.1557600 x 107 sec 

Light year 1 light (Julian) year = 9.460730472 x 10!7 cm 

Mean earth-sun distance 1 A.U. = 1.4959787066 x 10!3 cm 

Parsec 1 pe = 648000/z A.U. = 3.0856776 x 10!8 cm 
= 3.2615638 light (Julian) year 

Solar mass Mo = 1.9891 x 1073 g 

Solar luminosity Lo = 3.845(8) x 10? erg sec! 


Apparent luminosity for apparent magnitude m 

L = 2.52 x 10-° erg em~? sec! x 1077"/> 
Absolute luminosity for absolute magnitude M 

L = 3.02 x 1035 erg sec! x 10-2”/5 
For a Hubble constant Hp = h x 100 km sec7! Mpc7!: 


Hubble time Hy ' = 3.085747! x 10!” sec = 9.778 h`! x 
10° years 
Hubble distance c/ Ho = 2997.92458 h-! Mpc 
2 
Critical density Perit = in = 1.878 h? x 107? g cm~? 
= [0.00300 eV]* h? 


2From Allen's Astrophysical Quantities, ed. A. N. Cox (AIP Press, New York, 2000). 
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Review of General Relativity 


In this appendix we offer a brief introduction to the General Theory of 
Relativity, Einstein’s theory of gravitation. This appendix is not a substitute 
for a thorough treatment of the theory, but it outlines the parts of the theory 
that are used in this book, and it serves to establish our notation. 


1 The Equivalence Principle 


General Relativity is based on the Principle of the Equivalence of 
Gravitation and Inertia, or the Equivalence Principle for short. The Equiv- 
alence Principle is a generalization of the familiar observation that, because 
of the equality of gravitational and inertial mass, freely falling observers do 
not feel the effects of gravitation. According to the Equivalence Principle, 
at any spacetime point in an arbitrary gravitational field there is a “locally 
inertial” coordinate system in which the effects of gravitation are absent in 
a sufficiently small spacetime neighborhood of that point. This Principle 
allows us to write the equations governing any sufficiently small physical 
system in a gravitational field if we know the equations governing it in the 
absence of gravitation: it is only necessary to write the equations in a form 
which is generally covariant — that is, whose form is independent of the 
spacetime coordinates used — and which reduce to the correct equations 
in the absence of gravitation. Such equations will be true in the presence 
of a gravitational field, because general covariance guarantees that they are 
true in any set of coordinates if they are true in any other set of coordinates, 
and the Equivalence Principle tells us that there is a set of coordinates in 
which the equations are true — the set of coordinates that is locally inertial 
at the spacetime location of the system in question. In general there will 
be more than one set of generally covariant equations that reduce to the 
correct equations in the absence of gravitation, but the differences between 
these equations always involve terms with extra spacetime derivatives, which 
become negligible if we restrict ourselves to a spacetime region that is small 
compared with the scale of distances and times over which the gravitational 
and other fields vary appreciably. 


2 The metric: Ticking clocks 


As an example of this procedure, consider the equation that governs the rate 
at which clocks tick in a gravitational field. Special Relativity tells us that if 
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a Clock ticks once in every time interval dT when at rest in the absence of a 
gravitational field, then the separation dé“ between the spacetime locations 
of successive ticks when the clock is moving in the absence of a gravitational 
field is governed by the relation 


nop dE”dE? = —dT? . (B.1) 


(Here é!, €*, and £? are the Cartesian space coordinates, using units of 
length in which the speed of light c is unity; €° = t; Nap is the Minkowski 
metric, the diagonal matrix with 71; = n22 = 733 = 1 and noo = —1; and 
repeated indices are summed.) The correct equation governing the ticking 
of the clock in a general gravitational field is then 


Syv(x)dx"dx” = —dT? , (B.2) 


where g,, (x) is the metric, a field defined by the two properties that, first, a 
transformation to a coordinate system x’" changes the metric to 


ax ox? 

ax!? Ix ’ 
and, second, in coordinates that are locally inertial and Cartesian at a point 
x, the metric at x is nag and its first derivatives at x vanish. (In Eq. (B.3), 
x#and x’ are the coordinates of the same physical point in two different 


coordinate systems.) Eq. (B.2) is generally covariant, because the coordinate 
differentials have the obvious transformation property 


Bq (X) = Suv (x) S (B.3) 


p 
dx? = — dx" , (B.4) 


so that 

ax! ax” ax’? at ax’? 
ax ax’? axk ax’ 
To repeat our general argument, Eq. (B.2) is true, because in locally inertial 
Cartesian coordinates it reduces to the equation (B.1) that describes clocks 
in the absence of gravitation, and its general covariance means that if it is 
true in one set of coordinates then it is true in any other set of coordinates. 
In the same way, the spacetime separation dx” of the ends of a small ruler 
whose length is dL when measured at rest in the absence of gravitation will 
in general be given by 


gho (x) dx" dx!” = gw) — 


Zu (x) ae dx” = dE? . 


Likewise, the differences dx” between the spacetime coordinates of two 
successive positions along a ray of light are governed by the equation 


Zuv(x)dx”“dx” =0. 
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3 Tensors, vectors, scalars 


Quantities that transform as in (B.3) and (B.4) are known as covariant ten- 
sors and contravariant vectors, respectively. In general, contravariant and 
covariant quantities are labeled with upper and lower indices, respectively, 
and for each such index there is a factor in the transformation rule of x’ /dx 
or dx/dx’, respectively. It is also possible to have mixed quantities, with 
some upper and some lower indices. For instance, there is a mixed tensor 
6", defined in any coordinate system by 


tag 2 , (B.5) 


Even though its components are the same in all coordinate systems this is 
a tensor because 

pX Ox” 8x? xe _ a5 

P axl xI axt xI T 


There is also a contravariant tensor g””, defined as the inverse of the metric 
gag, = dt. (B.6) 


To see that this is a tensor, just note that 


ax!" ax! ax’! ax! ax axt 
(ero x) jao = g” (x) Sint (x) 


OxP ax? OxP Ax? ax! ax 
aie p 2X7 əx” axt 
setioa. au ôs 5 ay Snr (x) = 8? —— ae ae ae 


Thus the quantity in parenthesis in the first line is the reciprocal of the 
transformed metric 


ax’! ax” 


po = IUÀ f, 
Poolo ae E N) 


verifying that the reciprocal of the metric is a contravariant tensor. 

A scalar s(x) is a quantity whose value at a physical spacetime point is not 
changed by a coordinate transformation; that is, using a set of 
coordinates x’ it is 


s(x’) = s(x) (B.7) 
The derivative v, = ds/dx" of a scalar s(x) is a covariant vector: 
as’ as ax’ axl 
To= SARN, a 
o = JyP T gan axl? YGP ee) 
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Scalars and vectors may be regarded as tensors with no indices or one index, 
respectively. We can make tensors out of other tensors by taking the direct 
product; for instance, if A“, and Bpo are tensors, then so is the direct 
product Cpo = A”,Bpo. We can also make tensors with fewer indices 
out of other tensors by contracting upper and lower indices, as Eqs. (B.2) 
and (B.6). As a special case, we often lower (or raise) an index on a tensor 
by taking the direct product of the tensor with the metric (or its inverse) 
and then contracting an upper (or lower) index on the tensor with an index 
on the metric (or its inverse). For instance, if A“ is a tensor, then so is 
Ap” = 8y)A"”, while if B,” is a tensor then so is B°” = g°° B”. Note that 
raising and lowering the same index just gives back the original tensor; for 
instance g7? (g,,,A“") = A°”. Any equation that states the equality of two 
tensors of the same type or that a tensor of any type vanishes is generally 
covariant. 


4 The affine connection: Falling bodies 


But not everything is a tensor. For instance, Eq. (B.4) tells us that the first 
derivative of the coordinate x“ of a particle with respect to some scalar 
quantity u that parameterizes position along the particle’s trajectory (such 
as the time on some fixed clock) is a vector 


dx’? B ax’? dx” 


= i B.9 
du əx” du B 
but the second derivative is not a vector 
d?x'’ _d (=) 
du? du \. ax du 
7 ax’? d?x” 32x? dx! dx” (B.10) 


Ox du? Ox ax” du du ` 


This means that the correct generalization of the equation d7é° /du? = 0 for 
the motion of a particle in the absence of gravitation is not d?x#” /du? = 0, 
because this equation is not generally covariant. Instead, to cancel the 
second term on the right-hand side of Eq. (B.10) we must introduce a quan- 
tity ri (x) defined by the transformation property 


jin 8x7 Ix IX 4 Or ORM ax” 


oP ax Ax’? ax’? HY Ax Ax” Ax’? Ax’? ’ 


(B.11) 


and the proviso that ri, œ) vanishes in a coordinate system that is locally 
inertial and Cartesian at x. The correct equation of motion for a particle 
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that is freely falling in a gravitational field is then 


axe a dx” dx” 
— = B.12 
du? W du du Ms (Bela) 


because this is generally covariant, and reduces to the correct equation of 
motion d*&“/du* = 0 in the absence of gravitation. The field TÌ, is known 
as the affine connection, and of course is not a tensor. 

There is a simple formula for the affine connection in terms of the metric 


OZpn , WSpv 9S pw 


ane. | əx” əx” Eus l 


(B.13) 


It is straightforward to check that this is generally covariant, and it is true in 
a locally inertial Cartesian coordinate system because in such a system both 
sides vanish, so it is correct for general gravitational fields and coordinate 
systems. 

A trajectory that satisfies Eq. (B.12) is called a spacetime geodesic, 
because on such a trajectory the integral 


J 2 j gwaa a Day 
uy 


di du 


is stationary under variations that leave x” (u) fixed at the endpoints u1 
and u2. Often instead of specifying the metric we specify the line element 
ds? = Zuvdx”dx” for arbitrary differentials dx”. 

The equation of motion (B.12) is not valid for just any choice of the 
parameter u. To see this, note that Eq. (B.12) implies a conservation law 


d dx” dx” | _ ðgZuv dx” dx” dx” T d?x! dx” re dx” d?x” 
m| du da) ox du du du?" ae du S Gu de 


OS pv dx” dx” dx* 
= > a ae Ta Ti E ER 
( ao CUAN TE a) du du du 


=0. (B.14) 


It follows that u must be a linear function of the proper time t, defined by 


dt = | —8 uv dx’ dx” , (B.15) 


which according to Eq. (B.2) is the time told by a clock that falls freely along 
with the particle. The only exception to this conclusion is for massless parti- 
cles like photons, whose spacetime trajectory satisfies the same equation of 
motion (B.12) as for a massive particle, but for which the conserved quantity 
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dt/du vanishes. For a massless particle or a ray of light we need to choose 
the parameter u as the time told by some other freely falling clock. 

In the case of non-zero mass m, it is convenient to take the affine param- 
eter u as u = T/m, for then we can define the energy-momentum four-vector 
as 


a me _ dx 
Pe du’ 
and, using Eq. (B.15), 
Lup" Pp’ = -m . 


For massless particles we have g,,(dx"/du)(dx”/du) = 0 however u is 
normalized, so we can simply suppose that it is normalized in such a way 
that the energy-momentum four-vector is p” = dx" /du. 


5 Gravitational time dilation 


These results allow us to derive one of the most important consequences of 
the Equivalence Principle. For a slowly moving particle, dx'/du is much less 
than dx°/du, so Eq. (B.12) becomes 


ax ; dx? dx? 
— ——=0. B.16 
du? 0 du du Ca 
For a weak gravitational field, the metric g,,, is nearly equal to the Minkowski 
metric Hy», SO 
Suv = Nu + hwv » (B.17) 
with the components of /,,, much less than unity. We then can take u = 


t ~ x° = t, so the equation of motion of a freely falling slowly moving 
particle in a weak gravitational field is 


dt? 


where i runs over the values 1, 2, 3, labeling spatial directions in a Cartesian 
coordinate system. The affine connection for a weak gravitational field is 


4 Toh ah ah 
Bogda pu pv _ OMpy 
Tav = 2 | Ox” = Axk# xP | l (BIA 


=-Th,, (B.18) 


In particular, for a weak time-independent gravitational field we have 


(B.20) 
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Eqs. (B.18) and (B.20) allow us to identify —hoo/2 as the Newtonian 
gravitational potential ¢, so in a weak static gravitational field we have 


go œ~ -1 —2¢. (B.21) 


Now consider a clock at rest in such a field. According to Eq. (B.2), if the 
time between ticks in the absence of a gravitational field is dT, then in the 
presence of the field it is dt, where 


(=1=20)d? ~ — dT? . 
Hence the time between ticks is no longer dT, but rather 
dt ~ (1 — p)dT . (B.22) 


In the negative gravitational potential at the surface of a star clocks there- 
fore tick more slowly than in interstellar space, or in the much weaker 
gravitational potential at the surface ofthe earth. This could not be observed 
on the star’s surface, since all physical processes would be slowed there by the 
same factor, but it is observed at a distance, by measuring the decrease in the 
frequency of photons emitted from atomic transitions on the star’s surface. 
The gravitational time dilation is measured most accurately by observing 
the shift of spectral lines as photons rise or fall in the earth’s gravitational 
field. 


6 Covariant derivatives 


Although the spacetime derivative of a scalar field is a vector, the derivative 
of a vector or a tensor field is in general not a tensor. For instance, a 
contravariant vector field v” has the transformation property 


ax’? 
Pa gh, (B.23) 
oxe 
so that 
dul de” ax” dwk 8x? ax” 


H, (B.24) 


= v 
ax’? ax! ax’? ax” — AxHAX” Ax’? 


To construct a tensor we must add a term that cancels the second term in 
this transformation law. In this way we are led to introduce a covariant 
derivative 


H dul Mon 
VU sv — ax + Pye 5 (B.25) 
which does transform as a mixed tensor 
ax’? ax” 
vo = r Pe (B.26) 
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Likewise, the covariant derivative of a covariant vector is defined by 


-T v, (B.27) 


and is a covariant tensor. More generally, the covariant derivative of a 
tensor with any number of upper and/or lower indices is given by simi- 
lar formulas, with a +I for every upper index and a —T for every lower 
index. It is easy to check that the covariant derivatives g„v;\ of the metric 
tensor all vanish, as they must, since in a locally inertial Cartesian frame 
the covariant derivative is an ordinary derivative and the first derivatives 
of the metric vanish, so that the covariant derivatives vanish, and a tensor 
Zuv:\ that vanishes in one coordinate system must vanish in all coordinate 
systems. 


7 Effects of gravitation: The Maxwell equations 


Given the equations that govern some set of fields in the absence of grav- 
itation, we can find the equations that apply (at least in sufficiently small 
regions) in a gravitational field by replacing all Minkowski metrics 7 with 
metric tensors and all derivatives with covariant derivatives. As mentioned 
earlier, the procedure does not give a unique result, since there are tensors 
formed from second and higher derivatives of the metric that vanish in the 
absence of gravitation, but the effect of including such tensors in the gen- 
erally covariant field equations would be negligible in a sufficiently small 
spacetime region. 

For instance, in Cartesian coordinates in the absence of gravitation 
electric and magnetic fields are governed by Maxwell’s equations 


Og Fk? = -JÊ , (B.28) 
ðaFgy + OgF ya + Oy Fag = 9, (B.29) 


where F°? is the electromagnetic field strength tensor (with F°! = E), 
F? = By, etc.), J” is the electric current four-vector (with J? the charge 
density and J!, J”, J? the electric current density), and 


Fag = Nayngs F”? . (B.30) 
Hence in the presence of gravitation the field equations are 
F., = —J” (B.31) 


Favia T FPyasu T Fyuw =0 > (B.32) 
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with 
À 
Fav = Sul . (B.33) 


We use the same letter of the alphabet for tensors like F,,, and F”” that 
are related by raising and lowering indices by contraction with the metric, 
because they represent the same physical quantity. 


8 Currents and conservation laws 


For a system of particles labeled by an index n, with spacetime coordinates 
xh (u) and electric charges e,, the electric current four-vector is given by 


e l 4 dx" (u) 
J” (x) — eai He (xn(w) - x) Aa 

=.= fi ; dx"(t) 

= TDR 3 en ô (xn(0 — x) an 


(B.34) 


Here 6*(z) is a fictitious function with an infinitely narrow and infinitely 
high peak at z = 0, normalized so that, for any smooth function f(z), 


firo to- xfa), 


and 53(z) is the same in three dimensions. In particular, the integral of 
J/—Detg J? over a finite three-dimensional volume equals the total electric 
charge within that volume. (In cosmology we would be more interested in 
the baryon current, with the baryon number of the nth particle or the nth 
galaxy appearing instead of e,.) The factor 1/,/—Detg(x) is needed here, 
because the four-dimensional delta function is not a scalar. We can see this 
by noting that, under a transformation from coordinates x“ to coordinates 
x’, the differential spacetime volume element is changed to 


ax! 


d4 a 
i Ox 


d*x (B.35) 


where |dx’/0x| is the Jacobian of the coordinate transformation — that 
is, the determinant of the matrix whose components are 0x’ /dx". The 
Jacobian can be expressed in terms of the determinants of the metrics; by 
taking the determinant of Eq. (B.3), we find 


=2 


ə 
Det g' = = Detg. (B.36) 
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Hence it is the spacetime volume d*x./—Detg rather than d*+x that trans- 
forms as a scalar. (A minus sign is inserted in front of the determinant of the 
metric, because in physical spacetimes this determinant is negative.) From 
the defining equation of the delta function 


fy) = J P 54(x— y) 
= J (dix Dg) £09 (8 — »y//—Detg@) . 


we see that it is the ratio 54(x, — x)/,/—Det g(x) appearing in the current 
(B.34) rather than the delta function itself that transforms as a scalar. This 
current satisfies the conservation law 


i (/—Detg(x) J"(x)) £ J du 2 enb nlu- =0, (B37) 


provided x is not at the value of any xn (u) at either endpoint of the integral. 
This is the same as the generally covariant conservation law 


ð 
0 = Teo = a + Egg” (B.38) 


because : i 

Tiv = 58” ugua = zu In ( — Det) ’ 
This is the correct conservation condition, because in the absence of 
gravitation there is a current that in Cartesian coordinate systems satis- 
fies the conservation law 0,,J“ = 0, and therefore in general coordinates 
in a gravitational field must satisfy the generally covariant generalization 
(B.38). 


9 The energy-momentum tensor 


Likewise, in the absence of gravitation any set of particles and/or fields will 

have a symmetric energy-momentum tensor 7%, which is conserved in the 
sense that 

aT 

axP 

Just as JÊ is the 6 component of the current of electric charge, we can 

think of T°? as the 8 component of the current of p®. In the presence of a 
gravitational field the conservation law becomes 


aT’ 
ox” 


=0. (B.39) 


T”, = rer 2 Pore =. (B.40) 
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The final F term here is a geometric effect, similar to what we found for 
conserved currents, but the other I term represents the exchange of energy 
and momentum between gravitation and the other fields. 

For instance, for an ideal gas of particles that move freely except for 
gravitational forces and perhaps for collisions that are localized in space, 
the energy-momentum tensor is given by replacing e, in Eq. (B.34) with the 
energy-momentum four-vector ph = Ey dx! /dt, whose spatial components 
are the components of the three-momentum, and p? = Ey: 


Sete. l 4 dxn (u) , 
T” (x)= J-Deg 0) J HD? (xn(u) = x) dü P (u) 
1 
= EE F(x = x) ph pj ()/En(O . (B.41) 


In particular, T° is the energy density. Direct calculation using the equation 
of motion (B.12) shows that this satisfies the covariant conservation law 
(B.40). 


10 Perfect and imperfect fluids 


A perfect fluid is defined as a medium for which at every point there is 
a locally inertial Cartesian frame of reference, moving with the fluid, in 
which the fluid appears the same in all directions. In such a locally inertial 
co-moving frame the components of the energy momentum tensor must 
take the form 


Ti =p, T° =T%=0, T” =p, 


where i and j run over the three Cartesian coordinate directions 1, 2, 3. (This 
is because non-zero value of T’? and any term in TË other than one propor- 
tional to ô;; would select out special directions in space, such as the direction 
of T”, or of one of the non-degenerate eigenvectors of TË.) The coefficients 
p and p are known as the pressure and energy density, respectively. Then 
in a locally inertial Cartesian frame with an arbitrary velocity, the energy- 
momentum tensor takes the form 


T = pn + (pt puu? , (B.42) 


where p and p are defined to be the same as in the co-moving inertial 
frame, and u” is defined by the conditions that it transforms as a four- 
vector under Lorentz transformations, and that in the locally Cartesian 
co-moving inertial frame it has components u? = 1 and ut = 0. This four- 
vector, known as the velocity vector, is normalized so that, in any inertial 
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frame, NopueuP = —1. It follows that in a general gravitational field the 
energy-momentum tensor of a perfect fluid is 


TY = pg? + (p+ p)u"u”, gyyutu = —1 (B.43) 


where p and p are defined by the condition that they are equal to the 
coefficients in the energy-momentum tensor in a locally co-moving iner- 
tial coordinate system, so that they are scalars, and u” is defined by the 
conditions that it transforms as a four-vector under general coordinate 
transformations and has the components u? = 1 and u! = 0 in the locally 
co-moving Cartesian inertial frame. This formula for T“” is correct because 
it is generally covariant and it is true in locally inertial Cartesian coordinate 
systems. The equations of relativistic hydrodynamics in a gravitational field 
are derived by imposing the conservation condition (B.40) on this tensor. 
In addition, if the pressure depends on the density n of some conserved 
quantity such as baryon number as well as on the energy density p, then 
we need the equation of conservation, which in locally inertial Cartesian 
frames reads 


1 (n u”) =); (B.44) 


Thus in a general coordinate system in an arbitrary gravitational field, we 
have 


(n u") Sie (B.45) 
su 

For an imperfect fluid, there is a small correction AT” to formula (B.42) 
for the energy-momentum tensor in locally inertial Cartesian coordinate 
system: 


TÊ = pn + (p + puču? + AT (B.46) 


and a small correction AN“ to whatever current may be conserved 


(n u” + AN”) = (B.47) 
Ox 

The scalar p is defined as the energy density observed in a co-moving frame 
in which uv! = 0, so that in this frame AT°? = 0. This implies that in 
all locally inertial Cartesian frames uug AT «ß — Q, since this quantity is a 
scalar and vanishes in a co-moving frame. The scalar n can be defined as the 
value of the conserved density observed in such a co-moving frame, so by the 
same reasoning in all locally inertial Cartesian frames we have ug AN” = 0. 
The pressure can be defined as whatever function of p and perhaps n gives 
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the pressure in a static homogeneous fluid. (A different definition is used in 
Chapter 5 et seq.) But the definition of the velocity four-vector u” remains 
somewhat ambiguous. We could define wu’ to be the velocity of particle 
transport,! in which case in a co-moving frame with u’ = 0 we would have 
AN‘ = 0 as well as AN? = 0, so that in general locally inertial Cartesian 
frames AN® = 0. Instead we will define uv! to be the velocity of energy 
transport, so that in co-moving frames we also have T? = AT'® = 0, and 
so ugAT*% = AT®™ = 0 in this frame, which implies that ug AT”? = 0 
in all locally inertial Cartesian frames, but in general AN® 4 0. With this 
definition of velocity, the second law of thermodynamics together with the 
conditions ug AT’ = 0 and u,AN® = 0 gives? 


ou ðu ðu ðu 
ATag = —1 (5 = + ugu” r + Uau” wt) 


axB axe ax’ ax! 
2 ðu” 
=(0 — 5 (nap + Uap ) ; (B.48) 


ANa 


AE [2 e: a 


Here n, ¢, and x are the positive coefficients of shear viscosity, bulk viscosity, 
and heat conduction, respectively, and jz is the chemical potential associated 
with the conserved quantum number, defined by the condition that the 
entropy density is (p + p — wn)/T. It is then an immediate consequence 
of the Equivalence Principle that in general coordinate systems in arbitrary 
gravitational fields that vary little over a mean free path or mean free time, 


k k 
AT w = -=n (upv F Uy;u T Uyu Uu; + Upu Uy«) 


2 
—(¢ — zU" « (su T upu) > (B.50) 


CA a eA 


AN, 


II 
l 
x 


l This is the option adopted by C. Eckart, Phys. Rev. 58,919 (1940), and also in Secs. 2.11 and 15.10 
of G&C. 

2This is the definition used by L. D. Landau and E. M. Lifschitz, Fluid Mechanics, trans. by 
J. B. Sykes and W. H. Reid (Pergamon Press, London, 1959), Section 127. We will adopt this definition 
of velocity in this book, because it imposes the maximum possible constraint on the energy-momentum 
tensor at the cost of putting less of a constraint on the current of conserved quantities, and in cosmology 
we frequently have to do with situations in which there are either no non-zero conserved quantities at 
all, such as in the early universe before cosmological leptogenesis or baryongenesis, or no conserved 
quantities that are large enough to seriously affect the relation between pressure and density, as in the 
radiation-dominated era at temperatures above about 104K. 

3 Landau & Lifschitz, op. cit. 
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11 The action principle 


There is a general algorithm for deriving the energy-momentum tensor for 
systems that may be more complicated than an ideal gas or perfect fluid, 
provided only that they are governed by an action principle. According 
to the action principle, the differential equations governing the behavior 
of particles and fields can be expressed as the statement that the “matter 
action,” a functional /,,, of the fields and particle trajectories, is station- 
ary with respect to infinitesimal variations of the fields and particles. The 
Equivalence Principle tells us to include the metric in the matter action in 
such a way that Im is invariant under general coordinate transformations. 
Then the change in the action when we make an infinitesimal change ôg,» 
in the metric (leaving all other dynamical variables unchanged) must be of 
the form 


Slm = 5 | tv Dets T Œ) bgu (xX), (B.52) 


where T#”(x) is a symmetric tensor, which we identify as the energy- 
momentum tensor. 

For instance, the action for a gas of charged particles with masses mp, 
charges en, and trajectories x% (u) interacting with electromagnetic fields is 
taken as 


1 
In = = J d*x,/—Det g F ıvFpo g” g” 


dxh (u) dx? (u) Me 
= Zorn f au -ewent du dn | 


dxh (u) 
+E fau Ju A,(u) 


in which the homogeneous Maxwell equations (B.29) are enforced by writing 
the field strength tensor in terms of a vector potential A,, as Fav = 0, Ay — 
0yA,,. The reader can verify that the conditions for the matter action to 
be stationary with respect to arbitrary small variations in A, (x) and x} (u) 
(arbitrary, except that they vanish for x“ — oo or u — +00) are the 
inhomogeneous Maxwell equations (B.28) together with the equations of 
motion of charged particles in a combined gravitational and electromagnetic 
fields 


Xn pu , 
dt; "P dTn dn eo 


d? H dx’ d p dx” 
Mhn | Py oe = en i 
dtp 
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where t, is the invariant proper time along the nth particle trajectory, 
defined by 


du du du 
Eq. (B.52) gives an energy-momentum tensor 


H v 1/2 
din _ -emen (u) dx; 2) . 


dxh dx} 4 
e EE 
dt dtn E 


T” (x) = [-Detg(x)] 1 Som J dT 


PPE (AEs Y= ie OOF P9 (x)Fpo (xX) , 


which is the same as Eq. (B.41), with extra terms representing the energy 
and momentum in the electromagnetic field. (In the derivation we use the 
relations ôDetg = Detg gH” guy and dg"? = =g” gog’ .) 

The justification of the identification of T#” in Eq. (B.52) is that this 
tensor is conserved, in the sense of Eq. (B.40). To show this, note that in 
general our assumption that J, is a scalar tells us that it is unchanged if we 
simultaneously make the replacements 


dx > d*x’, oE 2 


— B.53 
axl ax!’ ( ) 
m ıl Ia dx” 
xh U) —> xp U), Au) > 4,0) = 5 Av) ’ (B.54) 
ax? ax® 
Zw x) > AMED) = Jak gy EPI x), (B.55) 


and likewise for any other fields entering in the action. But the coordinate 
x” (unlike x'% (u)) is just a variable of integration, so we can change x’ 
back to x“ everywhere without changing Im. It follows that J is unchanged 
by the replacements 


xH(u) > x (w) , (B.56) 


0 v 
Ay(x) > A, = ar v(x) — [A (x) -AO (B.57) 


A ax? ax? ia oy j 
Suv(X) > Sy) = Baxi giv Ee (x) — [guy@) — 4,00], (B.58) 


with x” and 3/3x” now left unchanged. (This combination of a coordinate 
transformation and a relabeling of coordinates is sometimes called a gauge 
transformation.) For a general infinitesimal coordinate transformation we 
have x" —> x + e#(x), with e“(x) arbitrary infinitesimal functions of x. 
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Then the transformations (B.56)-(B.58) become 


xh(u) > xh (u) + (xn) , (B.59) 
Aux) > Ay) ~ “<4, — “HO ory, (B60) 
Suv (x) > Suv(x) — a) Spv(x) — ae ae (x) 

-w <0) l (B.61) 


Now, as long as the equations of motion of particles and the field equations 
for “matter” (including electromagnetic) fields are satisfied, the action is 
unaffected by any infinitesimal changes in particle trajectories and matter 
fields. On the other hand, unlike the total action of matter plus gravitation, 
the matter action is not stationary under variations in the metric, even 
when the field equations are satisfied. For a general infinitesimal coordinate 
transformation, using Eq. (B.61) in Eq. (B.52) lets us write the condition 
that Im is a scalar as 


0 = ôm = J d*x \/—Detg(x) T œ) |- U 5 56) = ge) 


axl ox” 


= O8yv(x) 
oxP 


eow] , (B.62) 


Integrating by parts and setting the coefficient of €? (x) equal to zero then 
yields the conservation condition (B.40). 


12 Scalar field theory 


We will frequently encounter cosmological models involving a scalar field 
v(x), with an action* 


l 
I=- J d*x ,/—Detg Fae Ta vo] , (B.63) 


Ox ax? 


where V (ọ) is a function known as the potential. The field equations for 
ọ in a gravitational field are given by the condition that this be stationary 
with respect to variations in g: 


1 ð 3p IV (p) 
———._ — | y —Detg g” — | = . B.64 
„~ —Detg 0x” | mee x| dg ( ) 


4Tn general, the coefficient of the first term in square brackets might depend on ø, but such a field- 
dependent coefficient can always be eliminated by a redefinition of the scalar field. This simplification 
is not generally possible, however, with more than one scalar field, the case discussed in Section 10.4. 
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The energy-momentum tensor for the scalar field is found by varying the 
metric and comparing with Eq. (B.40): 


dp Op 


l ic 99 3P 
gh? — —. 
OxP Ox? 


TH” — =g” 5 


fv B.65 
2 Ox? ax? ( ) 


+ vo] +g g” 


This has the same form as the energy-momentum tensor (B.43) for a perfect 
fluid, with energy density, pressure, and velocity four-vector given by 


1 ap Oy 
SoG MV oe 
REE e Sager t Vg) (B.66) 
1, 9 dy 
— __ gk es 
p=- VO (B.67) 
ap ag] P 3E 
Mi _ | _gfd ae HT 
SNS | Saxe “| xt eo 


(The sign of u” does not affect the energy-momentum tensor, so it cannot 
be found by comparing Eqs. (B.65) and (B.43). It is chosen here so that 
u? should have the value u? = +1 in the case considered in Chapter 4 — 
a spacetime with g°? = —1 and a scalar field that does not depend on 
position and increases with time — provided the square root is understood 
to be positive.) The reader can check that T$” is conserved in the sense of 
Eq. (B.40) as a consequence of the field equation (B.64) for g. 


13 Parallel transport 


A body carried along an orbit x“ = x(t) may be characterized by one 
or more ¢-dependent vectors or tensors. If these vectors or tensors do not 
change at time ¢ in a frame of reference that is locally inertial at x” (t), then 


5This result for pressure is different from that given (without explanation) by E. W. Kolb and 
M. S. Turner, The Early Universe (Addison—Wesley, Redwood City, CA, 1990), Eq. (8.21), accord- 
ing to whom the pressure of a scalar field in a Robertson—Walker metric is 


p= /2— Vy) — (V9) /6a" , 
instead of the result for a Robertson—Walker metric given by Eq. (B.67) 
p= 8/2- Vig) — (V9) /2a° . 
The Kolb-Turner result is obtained if we define the pressure as the value of T';/3 measured by an 


observer in a locally inertial coordinate system moving with four-velocity vu 


p= (gup = Upvp) TP? (65 + vov") > 


1 
3 
but take the velocity v“ to have components v? = 1, vÍ = 0. It seems more natural in using this formula 


for pressure to take v” as the velocity u” given by Eq. (B.68), for this is the velocity that appears in the 
perfect fluid formula (B.43) for the energy-momentum tensor. This choice of v¥ leads to Eq. (B.67). 
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in an arbitrary frame of reference they will undergo a change known as 
parallel transport. For instance, a contravariant vector v” (t) will have a rate 
of change 


dv" (t) 
dt 


To check that this is valid, we note that it is assumed to be true at time t 
in frames of reference in which the affine connection wae x(1)) vanishes, 


7 ey 


=r% (x o)” (1) (B.69) 


and it is generally covariant, and it is therefore true in all frames if it is 
true in such locally inertial frames. In particular, Eq. (B.12) shows that 
the momentum p” = dx” /du satisfies the equation of parallel transport. 
Similarly, any tensor carried along an orbit that does not change in a frame 
that is locally inertial at a point along the orbit will have a rate of change in 
general frames at that point given by a sum of terms like the right-hand side 
of Eq. (B.69), with a —T for every contravariant index and a +T for every 
covariant index. For instance, for a covariant tensor J,), the equation of 
parallel transport is 


dJ v(t) 
dt 


dx my ss = 


=T COSO rf, (xo) J up) 


14 The gravitational field equations 


It remains only to give the equations that govern the gravitational field itself. 
These must satisfy two requirements: they must be generally covariant, and 
for weak slowly changing gravitational fields they must yield the Poisson 
equation for the Newtonian potential ¢ in Eq. (B.21) 


V? =4nGT” , (B.70) 


where G is Newton’s constant. If we limit ourselves to partial differential 
equations that (like the Poisson equation) have just two spacetime derivatives 
of the metric, then the field equations are unique: 


1 
Rw- 588 Rax = -8r GT wv , (B.71) 


where T» is the energy-momentum tensor with lowered indices 


Twv = Sua vn T™ (B.72) 
and Rw is the Ricci tensor: 
ar, ar’ 
Ruy = aes T eee — TET he (B.73) 
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For weak static fields the time—time component of this equation is the same 
as the Poisson equation (B.70), provided goo is related to @ by Eq. (B.21). 
The tensor appearing on the left-hand side of Eq. (B.71) satisfies a set of 
differential Bianchi identities: 


K 
je” (Ru — 588" Rn) | =O. (B.74) 


This is why this is the linear combination of the tensors Ry» and guvg™ Rix 
that appears on the left-hand side of the field equations; otherwise the field 
equations would not be consistent with the energy-momentum conservation 
law (B.40). The field equations could have been derived more easily by 
including a gravitational term in the action 


1 
Das 
2 167G 


J d*x,/—Detg(x) g% (x) Ry . (B.75) 


The Bianchi identities (B.74) can be derived from the fact that J, is auto- 
matically invariant under general coordinate transformations of the metric, 
and the field equations (B.71) can be derived from the condition that the 
total action I, + Im be stationary with respect to arbitrary variations of the 
metric. 

If we allow terms in the gravitational field equation with fewer than 
two spacetime derivatives, then it is possible to include a term on the left- 
hand side of the field equation (B.71) proportional to g,,,. This is the 
so-called cosmological constant term, discussed in Section 1.5. It can be 
regarded as a “vacuum-energy” correction to Ty. Aside from a cosmolog- 
ical constant, the only other possible modification of the left-hand side of 
Eq. (B.71) would involve terms with more than two spacetime derivatives. 
(They can be derived, for instance, by including terms in the integrand of the 
gravitational action proportional to /—Detg R” R, or /—Detg (R",,)*.) 
Dimensional analysis tells us that such terms would have coefficients whose 
dimensionality (relative to that of the factor 1/167 G in Eq. (B.75)) is a posi- 
tive power of length. The experimental success of General Relativity shows 
that this length is much smaller than the scale of the solar system, so the 
effect of such terms would be completely negligible at cosmological distance 
scales. 
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Appendix C 


Energy Transfer between Radiation 
and Electrons 


One often needs to know the rate at which a photon will lose or gain energy 
when passing through an ionized gas. For instance, we need this information 
in calculating the Sunyaev—Zel’dovich effect, discussed in Section 2.5, and in 
understanding the preservation of thermal equilibrium between matter and 
radiation, treated in Section 2.2. This appendix will first derive formulas 
for the mean and the mean square change in energy of a photon in striking 
a single electron, and will then use these results to derive the rate of change 
of the photon energy distribution function in passing through an ionized 
gas. For the purposes of this section, we take i= c= 1. 

Suppose that an electron traveling in the three-direction with momentum 


four-vector 
p= (0, 0, pe, Ec) > Ee = 4 p2 + m? ny (C.1) 


is struck by a photon with energy œ moving along a direction with polar 
and azimuthal angles n and ¢, giving the photon a new energy w’ and a 
new direction with polar and azimuthal angles n’ and ¢’. That is, the initial 
four-momentum q and final four-momentum q’ of the photon take the forms 


q= (sin n cos @, sin 7 sin $, cos n, I)o, (C.2) 
q = (sin n' cos ¢’, sinn’ sind’, cos n’, I)o : (C.3) 


To calculate the cross section for this scattering event, we perform a Lorentz 
transformation to a frame of reference in which the electron is initially at 
rest, in which case the cross section takes a simple and well-known form. In 
the electron rest-frame, the initial and final photon four-momenta are Lq 
and Lq’, where L“, is the Lorentz transformation 


1 0 0 0 
0 1 0 0 

L= ; C.4 
00 y py a 
0 0 -y y 


where 8 = pe/Ee is the electron velocity, and y = (1 — 6*)~!/?. This gives 
the initial and final photon four-momenta in the electron rest frame as 


Lq = | sinacos®@, sina sing, cosa, 1)k : (C.5) 
Ld = ( sina’ cos ¢’, sina’ sing’, cosa’, 1)k’ : (C.6) 
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where the initial and final photon energies in the electron rest frame are 
k=(—Bycosn+y)o, k’ = (~By cosy + y)a! (C.7) 


while the initial and final polar angles of the photon’s velocity in this frame 
are given by 
cos7n — B , cos —B 


cosa = ——— COS QŒ 


ce ue, C8 
1 — Bcosn’ 1 — B cos y’ oe 


and there is no change in the azimuthal angles. We also recall that the 
conservation of energy and momentum in the electron rest frame gives the 
final photon energy in this frame as! 


k 
Pehee; C.9 
1 + (k/me)(l — cos 0) a) 
where 6 is the scattering angle in the electron rest frame 
cos@ = Lq- Lg = cosacosa’ + cos(¢ — ¢’) sina sina’ . (C.10) 


The fractional change in the photon energy in the original frame of reference 
can then be expressed in terms of quantities in the electron rest frame by 


o — w 1 1 + 8 coso’ ) 
= — —1}). (Cll 
w 1 + £ cosg ( ppsa Ci 
In most cases of interest k/me and f are much less than unity, so we will 
keep only terms of first order in k/me and zeroth order in f together 
with terms of (for the moment) arbitrary order in 6 and zeroth order 
in k/me: 


SAA Gascon L A (C.12) 
1+ Bcosa 


For an electron at rest and for photon energy k < me, the cross-section 
differential (Summed over final spins and polarizations, and averaged over 
initial spins and polarizations) is 

_ 307 


E 2 / 1 1 
do ex (1 + cos 6) d(cosa’) do ; (C.13) 


1QTF, Vol. I, Eq. (8.7.14). 
2QTF, Vol. I, Eq. (8.7.42). 
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where øz is the Thomson cross section e*/62m?2. The cross-section differ- 
ential is itself Lorentz invariant, though the variables on which it depends 
are not, so the average photon energy change per collision is 


1 
— | (œ —a)do 
OT 


1 27 
= =/ dosa’) f dd’ (w' — w) (1 + cos? 0) 
16x an| 0 


ae Bwcosa hie C14 
1+ Bcosa me 


(W — w) 


We must average this over the electron direction of motion, or equiva- 
lently over œ. In calculating this average we must keep in mind that it is 
cos ņ rather than cosa whose distribution function in the interval from —1 
to +1 is flat. Also, the transition probability is proportional not only to do 
but also to the relative speed? 

Ip -ql 1:=p? 
u= 


=1]1— = —_—_.. CAS 
Eww pegsi 1 + £ cosa ( ) 


Hence the average of the energy change over the relative direction of the 
initial electron and photon is 


+1 +1 
((o! oy) = f (o! -ayudeosn / | udcosn 
—1 —1 


-1 f” (1 — 8)? |- Bacosa kw 


= — d ; 
2J-1 (1+ fcosa)3 1+ Bcosa e] eR 


(C.16) 


The terms of first order in 6 make no contribution to the integral over œ, so 
the leading terms in the fractional energy transfer are those of second order 
in £ or first order in w/me: 

k 4 


4 
la — w))/@ = -a DOEN SE (C.17) 
Me 3 Me 


Since the mean photon fractional energy change contains terms of order 
B* and w/me, we will be interested in any terms in the average squared 
fractional energy change of the same order. Inspection of Eq. (C.12) shows 
that to this order, we have 


(a — w)? = B?(cosa’ — cos œ)? (C.18) 


3QTF, Vol. I, Eq. (3.4.17). 
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To this order we can neglect the difference between u and unity and between 
d cos ņ and dcosa, so that 


3B2m2 f+! +1 
(((@! — )*)) ~ ee dcosa d cosa’ 
327 zi —1 

2x 

x d¢'(cos a’ — cos æ)? (1 + cos? 6) 
0 

2 

= iho. (C.19) 


All this is for an electron of a fixed speed £. If the number of electrons 
with speed between 6 and 6 + df is given by the Maxwell-Boltzmann 
distribution with electron temperature Tę, and is hence proportional to 
B? exp(—meB?/2kgT) dB, the average value of 8° is 3kgTe/me, and 
Eqs. (C.17) and (C.19) become 


2 
LE eae (C.20) 
Me Me 
and 
Wo — )?)) = eTe? (C.21) 


Now suppose that photons with a number N (@)dw with energies between 
œw and w + dq interact with a gas of non-relativistic electrons. For photon 
energies w < me, the rate of change of the distribution function is 


n(w)dw = ne J da’ n(a’) R(w’ > w)dæ (1 + N(o)) 
—Ne n(w) do | dw’ R(wa > w) (1 +N).  (C.22) 


Here ne is the number density of electrons; R(@ —> w’)da’ is the aver- 
age of udo over initial electron velocities for collisions in which a photon 
of initial energy œ is given an energy between œ and w’ + da’; N(w) = 
(22h)3n(w)dw/8xw"dw is the number of photons per quantum state of 
energy w (the denominator is 82wdw instead of 4rw?dw because of the 
two polarization states of photons); and the factors 1 + N (w) and 1+ N(a’) 
are included to take account of the stimulated emission of photons into 
states that are already occupied. The first term in Eq. (C.22) gives the 
increase in n(w) due to scattering of photons of any initial energy œ into 
energy œw, while the second term gives the decrease in (wm) due to scattering 
of photons of energy w into any final energy w’. We saw that collisions 
of photons of energy œ < me with a non-relativistic gas of electrons at 
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temperature T typically change photon energies by only small fractional 
amounts, of order w/me or kgT /me, so the rate constants R(w’ —> œw) and 
R(@ —> a’) are sharply peaked around w’ ~ a. It is therefore convenient 
to change the variable of integration from w’ to A = œ — a’ in the first 
term and to A = a’ — w in the second term of Eq. (C.22). Also canceling 
the unintegrated differential dw and multiplying with (27h)? /82w~, this 
formula now reads 


[0,0] 
N@) = al dA (w— A)? Nœ- A) Rœ — A > o) (1+ N()) 
WO” J—oo 
—neN (w) J dA Ria>o+A)1+N(@+A)). (C.23) 
Since the rate coefficients R(w — A > w) and R(w > w+ A) are sharply 
peaked around A = 0, we can expand the other factors in Eq. (C.23) (and 
the difference between R(w— A —> œ) and R(w —> w+ A)) to second order 
in A: 


No “ah dA A (1 +N) [o Nao keswa A)| 


(1 J vw) [o N@) R@ > o+ A)| 


B 


ƏN (Œw) 
Jw 


-ne No) f dA A R(w > w+ A) 


NO 


vi) f dA A? a E ENA 


This can be rewritten as 


E a E dA A Ž [o? N@) R@ > w+) (1 af N@))| 


2 
+55 kan [> No) Ro > w+ A) (1+ No))| 
aoe UA a? Z [e No R(o> o+ A) e] 
w Joo dw dw 


Inverting the order of integration and differentiation, this becomes 


. ; a ; 
Nw) =F E (o? N(w) (1+ N(@))((o! —@))) 
1 ə? 
Eg (° No) +N) = @)"))) 
+o N(w) ((a! — @)*)) 2) (C.24) 
Jw 
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in which we use 
f dA A R(w > w + A) = or (læ — w)), 
<f dA A? Ræ > o+ A) =or(((o' —@))). (C25) 


Inserting the values (C.20) and (C.21) of ((w’ — w)) and (((w! — w)*)) gives 
the Kompaneets equation:4 


neorkgTe ð | oa neor ð 
——— jo 


N (w) = Sa 
o) Mew Jw 


[o V@) (1 +Nw))| l 
(C.26) 


meo? Iw dw 


As a check, we may note that if the photon distribution function is 
given by the Planck formula with photon temperature equal to the electron 
temperature, so that N(w) = [exp(@/kgT) — 1]~!, then the two terms in 
Eq. (C.26) cancel, giving no change in N (œ), as of course must be the case for 
photons in equilibrium with electrons. As a further check, we note that (as 
long as œt N (w) and œt 3N /dw both vanish at w = 0 and œw —> oo) the total 
number density of photons {5° 42 w* N(w) dw does not change with time, 
as could also be seen by integrating Eq. (C.22) over w, and interchanging w 
and w’ in the second term. 


4A, Kompaneets, Zh. Exper. Teor. Fiziki 312+, 876 (1956). 
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In cosmology we often have to deal with position-dependent variables like 
temperature or density or scalar fields whose fluctuations are governed 
by some sort of probability distribution. In this appendix we will con- 
sider a general real random variable g(x) depending on a D-dimensional 
Euclidean coordinate x. The generalization to several random variables will 
be obvious. 

We will assume that the distribution function giving the probabilities 
of various functional forms for g(x) is homogeneous, in the sense that the 
average of any product of ys with different arguments depends only on the 
differences of the arguments. That is, for arbitrary z, 


(9(x1)9(X2) ++ (Xn) = (Pax + zox +2): 9an +z). (D1) 


In cases where x is a time coordinate, such distributions are said to be 
stationary. 

We will further assume that the gs at distant arguments are uncorrelated. 
To put this formally, we assume that for |u| — oo, 


(px + u)p(x2 +u): -gQ — ujo- u): -) 
> (p(x +ujy (x2 +u) +++) (Pı — upy — u)---) 
= (g(x p(x) ) (P002), (D.2) 


with the final expression justified by Eq. (D.1). 

Under these conditions, we can prove an important result known as the 
Ergodic Theorem: If the limit in Eq. (D.2) is approached sufficiently rapidly, 
then the root mean square difference between any product g(x; + z)o (x2 + 
z)-+-, averaged over a range R of z values around an arbitrary point Zo, 
and the ensemble average of the same product, vanishes as R~?/? for large 
R. That is, if we define 


AR(X1, X25. +) = (( f Pemo p(x, +z)o(x2 +z): 


2 
—(ø(xıi)o (x2): ) ; (D.3) 
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where! 
NRE) = (VTR? exp ( — Iz — zol?/R?) , (D4) 
then for R > œ, 
Ar > O(R- PP). (D.5) 


To prove this theorem, we first use the condition f Nr(z) d?z = 1 to 
rewrite Eq. (D.3) as 


AR (x1, X25 oS ( [amo Ee + z)p(x2 +z) 


2 
— (pr) p%): I) : 


Expanding the square and again using Eq. (D.1) and the normalization 
condition for Nr, and introducing new integration variables u = (z — w)/2 
and v = (z + w)/2, we have 


A3 = | Peno [ Pw Nro 


x (ves + Z)p(X2 +z): -oxı +w)e(x2 + w)»: ‘| 


2 
-vanea ) l 
2 D 
= (=) [are exp ( — 2|v — zol?/R?) [aru exp ( — 2lu?/R?) 
x (ves +wx +u): -gx — ujo (x — u): i 
2 
-vanea =) l 
2 \ 2/2 
= (=) [aru exp ( — 2|u?/R?) 


'The specific form (D.4) for the function Np is not essential. It is only important that the function 
be constant for |z — zg? < RÈ, vanish rapidly for |z — zl? >> R2, and be normalized so that 
J NRE) d?z = 1. 
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x (ves + ujp(x2 + u): -oxı — ugp(x2 — u) - - j 


2 
-vaoo . ‘| l : 


Assuming that the limit in Eq. (D.2) is approached sufficiently rapidly, the 
u integral would converge even without the factor exp ( — 2|u|? /R’), so for 
R — œ we can take this factor as unity, and find our final expression 


2 D/2 
T 


x (ves + u)p(x2 + u)-+- p(x1 — U)e(xX2 — U)-- i 


2 
-vano ii i | ; (D.6) 


thus confirming Eq. (D.5). 
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Appendix E 


Gaussian Distributions 


Consider a random variable g(x) depending on a D-dimensional coor- 
dinate x. We will define it to have zero average value, by subtracting from 
ọ any non-zero average it may have. The distribution function governing @ 
is said to be Gaussian if the average of a product of an even number of gs 
is the sum over all ways of pairing ys with each other of a product of the 
average values of the pairs: 


eaea) =)= Yo T] ee). (E.1) 
pairings pairs 
(with the sum over pairings not distinguishing those which interchange 
coordinates in a pair, or which merely interchange pairs), while the aver- 
age of the product of any odd number of gs vanishes. For instance 


(P(X1) (X2)9(%3) P(X4)) = (P(X1) O(X2)) (p (x3) (x4)) 
+ (p(x1) 9 (%3)) (P(X2) (x4)) + (px) (x4) (p (x2) (x3)). (E.2) 


(Of course, there is no way of pairing all of an odd number of gs, which is 
why for Gaussian distributions the average of any odd number of ys must 
vanish.) More generally, in the average of a product of 2n factors of g, each 
of (2n)! permutations of the coordinates defines a pairing (with the first 
permuted o paired with the second, the third with the fourth, and so on), 
but 2”n! of these differ only by permutations of the two coordinates within 
a pair or permutations of the n pairs, so in general this average contains 
(2n)!/2"n! terms. 

If g(x) is governed by a Gaussian distribution, then so is any linear 
functional of g(x). For instance, consider a set of linear functionals of the 
form 


Fuel = | dx fie) 0) (E.3) 


By multiplying Eq. (E.2) by fi (x1) fo(x2) /3(%3) fa(x4) and integrating, we 
find 


(Pi Fo F3F4) = (FF 2) (P3F4) + (Fi F3) (FoFa) + (Pi F4)(FoP3).  (E.4) 
Such distributions are called Gaussian because of the form taken by the 


probability distribution of general linear functionals F[g] = f dxf (x)p(x) 
ofo. In (F2"[g]), each of the (27)!/2"n! terms makes the same contribution 
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(F?[g])”, so 

(F*"[]) = (F?[g])"(2n)!/2"n! . (E.5) 
This implies that the probability P(F) dF that the functional is between F 
and F + dF is the Gaussian function 


hase sires dF (E.6) 
~ / On (F2) P\ UFA 


Distributions of this sort arise commonly when ¢(x) is the sum of a large 
number of independently fluctuating terms. The central limit theorem tells 
us that in this case the distribution of the sum is Gaussian. In the application 
that most concerns us here, the fluctuations in the temperature of the cosmic 
microwave background are believed to be nearly Gaussian because they arise 
(as discussed in Chapter 10) from the quantum fluctuations of one or more 
nearly free quantum fields. 
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Newtonian Cosmology 


During the era when the energy density of the universe is dominated by 
cold dark matter, the behavior of perturbations can be adequately treated 
by the methods of Newtonian mechanics. This has applications both for 
our introductory study of anisotropies in the cosmic microwave background 
in Section 2.6, and for the treatment of the large scale structure of matter 
in Section 8.1. 

The equations of non-relativistic hydrodynamics and Newtonian 
gravitational theory for a fluid with mass density p, velocity v, zero pressure, 
and gravitational potential ¢ are the equation of continuity 


ap 
T+V: (ve) =9, (F1) 
the Euler equation 
ƏV 
at (v v)y —_V¢, (F.2) 
and the Poisson equation 
V? =4nGp. (F.3) 


These equations have an unperturbed solution (distinguished by bars) of 
the form 


p = po(ao/ay, V=HX, 6 =2nGpXx’/3, (F4) 
where H = å/a; ao and po are constants; and a(t) satisfies the equation 
& + K = 8r Ga /3, (E5) 


with K a constant. (We use X to denote the ordinary Euclidean coor- 
dinate vector, to distinguish it from the co-moving coordinate vector x.) 
This of course corresponds to a cosmological theory with a Robertson- 
Walker metric, and indeed we have already encountered this solution in 
Section 1.5, as an alternative approach to the derivation of the Friedmann 
equation (F.5). In particular, the solution of the equation dX/dt = ¥ = 
HX for the motion of a co-moving object is X(t) = [a(t)/a(to)]X (to), 
in agreement with Eq. (1.5.22). The co-moving coordinate is thus 
x = X/a. 

To this unperturbed solution for pọ, v, and @ we now add small 
perturbations êp, dv, and ô. The terms in Eqs. (F.1)-(F3) of first 
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order in these perturbations are 


36 
5. +3HSp + HX-Vsp + pV -bv=0, (F6) 

06v 
C + HX: Vôv + Hv = —V5¢, (E7) 
V?8ġ = 4r Gôp . (F.8) 


Eqs. (F.6)-(F.8) do not appear translation-invariant, but the underly- 
ing spatial homogeneity of this problem can be restored by writing the 
equations in terms of the co-moving coordinate X/a. This is brought out 
most conveniently if we write the perturbations as Fourier transforms in the 
co-moving coordinate: 


p(X, t) = J dq exp (“AS a EOE (F9) 


and likewise for ôv and 5¢. The partial differential equations (F.6)-(F.8) 
then become the ordinary differential equations 


dôpq 


a i Pq: ôg =Q, (F.10) 

dé 
ma + Hôvq = —ia~!q5dy , (F11) 
q?5¢q = —4r Ga" ôpq . (F.12) 


The solutions of Eqs. (F.10)-(F.12) can be classified according to the 
transformation properties of the dependent variables under three-dimensional 
rotations: 


Vector modes: In these modes, all scalars vanish: not only dq and 5¢q, but 
also q - dvg. Then Eqs. (F.10) and (F.12) are automatically satisfied, while 
Eq. (F.11) becomes 


d5Vq 
dt 


whose solution is simply Vg x 1/a. Because the vector modes simply decay, 
they are widely ignored. 


+ Hévq =0, (F.13) 


Scalar Modes: In these modes, the velocity perturbation dv(X, t) can be 
expressed as the gradient (with respect to x = X/a) of a scalar potential 
perturbation ôu(X, t). For the Fourier transforms, this means that 


ÔVq = İq bug - (F.14) 
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Then, using Eq. (F.12) to eliminate d¢g, Eq. (F.11) becomes 


dôuq “4 4r Ga 
a + Hdug = —a 5¢q = qe ; (F.15) 
while Eq. (F.10) gives 
dê 
a +3Hôpq — a`! 5 q?ôug = 0 . (F16) 


For K = 0 we have 4r Gp = 3H?/2. Using p « a7? and the definition of 
H, we can eliminate dug from these coupled equations and write them as a 
second-order differential equation for the fractional density perturbation 


d ( 2d (Sbq 3 7722 ( Sha 
-H =] =0, F.17 
aalr rA me 
or, recalling that for K = 0 we have a x 17/3 and H = 2/31, 
d (ajd (Seq 2 2/3 ( Pa 
t — -t — ]=0. F.18 
dt ( dt\ p 3 p ( ) 


The general solution of this equation is a linear combination of the pow- 
ers £?⁄ and t—!. It is reasonable to suppose that by the time of last scat- 
tering only the leading mode with 5pq/p « 17/3 œ a will have survived. 
Since the mean density A(t) is proportional to a~*(t), we have dpg(t) « 
a~?(t). Eq. (F.12) then shows that dog is time independent, a result used in 
Section 2.6. 


To verify that these are the most general solutions of Eqs. (F.10)-(F.12), 
we need only count equations and solutions. With d¢@q(¢) eliminated by 
use of Eq. (F.12), Eqs. (F.10) and (F.11) are a set of 1 + 3 = 4 coupled 
first-order differential equations, so they have four linearly independent 
solutions. We have found two independent vector modes (corresponding to 
the two directions perpendicular to q) and two independent scalar modes 
(with 5pq// proportional to ¢7/? or t71), so these are indeed the most general 
solutions. 

The cases of non-zero pressure or K # 0 are treated in detail in 
Section 15.9 of G&C. 
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Appendix G 


Photon Polarization 


The polarization of photons is of great interest in cosmology, not only 
because it can be observed in the cosmic microwave background, but also 
because it affects the anisotropic inertia terms appearing in the gravitational 
field equations for both scalar and tensor perturbations. This appendix will 
review the description of the spin state of an individual photon in terms 
of photon polarization vectors and the related density matrix, and the 
somewhat unusual parallel transport equation that governs the time depen- 
dence of photon polarization vectors and density matrices in gravitational 
fields. 

Let us first recall how we describe the polarization of a photon in the 
absence of gravitational fields. The most general pure state of a single pho- 
ton is a linear superposition ~+ Y4 + a@_W_, where Vx are states of helicity 
+1 (that is, eigenstates of the component of angular momentum in the 
direction # from which the photon is coming! with eigenvalues o = +h, 
and normalization (Wo, Yo’) = 5g’) and œ+ are complex numbers satis- 
fying the normalization condition las|? + Ja_|? = 1. We represent such 
a state by a polarization vector e = a e+ + a_e_ with e* - e = 1, where 
e+ is the polarization vector for photons of helicity +1. For instance, for a 
photon that is seen coming from the direction? 


ñ = (sin@ cos@, sind sing, cos@) , (G.1) 

we can take 
e+ = (Ô + id)/V2, (G.2) 
where 6 and ¢ are orthogonal unit vectors in the plane perpendicular to ñ: 
6 = (cos@ cos ¢, cos@ sing, — sin 0) , 6 =(—sing, cos¢,0). (G3) 


A linearly polarized photon has |a,| = |a_|, and hence a polarization vector 
that is real up to an unimportant over-all phase factor. The opposite case is 
circular polarization, for which either |a,| or |w_| vanishes. Between these 


' This is the opposite of the direction of the photon’s motion, so the helicity, which is defined as the 
component of angular momentum along the direction of the photon’s motion, is the negative of the 
component along the direction from which it is coming. This is why we use a label + to indicate photons 
of helicity +1. 

2For correlations between polarizations in a small patch of sky, one can choose the three-axis to be 
in the direction 7 of this patch, in which case the polarization vectors for helicity =1 could be chosen 
simply as (1, +i, 0)//2. 
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two extremes is elliptic polarization, for which |a,| and |~_| are unequal, 
but neither vanishes. 

An individual photon will in general have probabilities P,, of being in 
any one of various pure states Y, = &m+ Y+ +a —WV_, so according to the 
rules of quantum mechanics, the probability of finding it in a particular state 
Y = a4W4+a_W_ represented by a polarization vector e = a,e++a_e_, 
will be 


Pe) = X Prin. V)? = D> Plat oy + aha- = ed NË (GA) 
m m 
where NÏ is the density matrix for that photon 
NE =) Pmei en - (G.5) 
m 


(Repeated indices 7, j, etc. are summed over the coordinate indices 1,2,3.) 
Because the probabilities are real and positive, the matrix N” is hermitian 
and positive: 


N? = NH, Ng > 0 for all €', Go) 


and because X` Pm = 1 and el eit = 1 (not summed over m), this matrix 
has unit trace 


NË =1. (G.7) 


Ofcourse, the photon polarization vectors are all orthogonal to the photon’s 
direction of motion —/, so also 


NÏ à = NĪ =0, (G.8) 


The scattering of light by non-relativistic electrons does not produce cir- 
cular polarization, and therefore we expect that all microwave background 
photons are linearly polarized, in which case NÏ is real. 

So far, we have defined the polarization vector ef in the absence of 
gravitation as a unit three-vector, transverse to the photon’s direction of 
motion. We can if we like define it in a gravitational field as the spatial 
part of a four-component object e that happens to have e? = 0, and that 
satisfies 


piel = p,e" =0, gje e™ = gue e" = 1, (G.9) 
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where p“ = gp, is the photon four-momentum.’ But since this object 


obviously does not transform as a four-vector there is no reason to think 
that its time dependence would be given by the equation (B.69) of parallel 
transport. Indeed, it cannot evolve by parallel transport, for if it did then 
the condition e? = 0 would not be preserved along the photon trajectory 
x! = x(t) unless it so happened that rx), t) vanishes, which is not usu- 
ally the case. Instead, we expect that in a locally inertial frame of reference 
in which the affine connection pe x(t) ) vanishes, the polarization vector 


of a photon will be time-independent only up to a gauge transformation 
et — e” + ap”, so that gauge-invariant quantities like the field strength 
pte’ — p’e will be time-independent. Then in a general frame of reference, 
a photon polarization vector will undergo parallel transport up to a gauge 
transformation: 


de! (t) dx? (t) Eg dx" (t) 


dt dt dt 


The gauge transformation parameter a(t) can then be determined from the 
condition that e? = 0, so that de? /dt = 0: 


= -Th (x@)er@ (G.10) 


dx*(t) 
dt ` 


a(t) = ri, (x)e" (G.11) 


With e? = 0, Eq. (G.10) then reads 


de/(t) 7 j 0 dx! (t) j dx* (t) 
P = |=", (s) raeo) 7 Jeo zo GD 


Note that the conditions (G.9) are preserved with time, for p, e” and guve” e” 
are preserved by ordinary parallel transport because they are scalars, and are 
preserved under gauge transformations because p,p” = 0 and pe” = 0. 

In a gravitational field we continue to define the statistical matrix NÏ by 
Eq. (G.5), and it continues to satisfy the hermiticity and positivity conditions 
(G.6), but now instead of (G.7) and (G.8) we have 


3We can construct e” in a gravitational field in a general coordinate system x“ from the polarization 
vector Chat of the photon in a locally inertial frame with coordinates £% , as 


oxe . 
u i u 
EN ag! flat + Bp f 


with £ adjusted to make e =0. 
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It follows from Eq. (G.12) that the statistical matrix of a photon moving in 
a gravitational field satisfies the parallel transport equation 


ij 
dN” (t) = |- ri, (x) x RA (x) dx' e] NÄ aw (t) 


dt 
. dx! f À 
+ [rh (x0) +8 (» 0) eo | NZL., 
(G.14) 
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The Relativistic Boltzmann Equation 


In this appendix we derive the Boltzmann equations for neutrinos and 
photons, that govern the evolution of the distribution of these relativistic 
particles in phase space. This will provide the basis for our account in Sec- 
tion 6.1 of the widely used numerical calculations of the evolution of scalar 
perturbations. In addition, these results will be needed in Section 6.6, where 
we evaluate the damping of tensor modes due to the anisotropic inertia of 
free-streaming neutrinos, and in Section 7.4, where we calculate the polariza- 
tion of themicrowave background produced bythe scattering ofan anisotropic 
distribution of photons by non-relativistic electrons. This appendix will first 
consider the simpler case of neutrinos, for which scattering may be neglected, 
and then turn to the more complicated case of photons, for which scattering 
plays an essential role. In an elementary application of the results for pho- 
tons, we will derive a formula for the rate of damping of acoustic waves in a 
medium of photons and charged particles, used in Section 6.4. 

Throughout this appendix we shall adopt a coordinate system for which 


go=—-l, gn =0, (H.1) 


while g; (x, t) is unconstrained. In the linear approximation, this form of the 
metric is automatic for tensor modes, and follows if we work in synchronous 
gauge for scalar modes. However, we will not specialize to the case of 
linear perturbations until later. With this metric, the only non-vanishing 
components of the affine connection are 


ee Li (= poi r) (H.2) 


Y 2 axl ax! əx! 
oe ee 1. 
r= 58 Èri , ry = zï. (H.3) 


1 Neutrinos 


It will be convenient to work with a neutrino distribution function 1, (x, p, t), 
defined by 


3 3 
nxp => (I B(x! — so) (11 5 (pi - oto) , HE4 


r i=1 i=1 


with r labeling trajectories of individual neutrinos (or antineutrinos). This 
expression has the defining property of a number density, that the integral 
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ofn, over any volume of phase space equals the number of neutrinos in that 
volume. According to Section 4 of Appendix B, the momentum variable in 
Eq. (H.4) is defined by pr; = gy; p}, where p“ is the momentum four-vector 


pi = dx! Jdu, . (H.5) 


where u, is a suitably normalized affine parameter for which the spacetime 
trajectory satisfies Eq. (B.12): 


d?x? dx; dx 
J2 +T} (xr) En =0. (H.6) 


Then between collisions the rate of change of the momentum is simply 
given by 


: 1 jk OL jk 
; = — - ; H.7 
Pri peor ee ( Axi ao ( ) 


while the rate of change of the coordinate is 
xi = pi/py (H.8) 
It follows then directly from Eqs. (H.4), (H.7), and (H.8) that in the absence 
of collisions, n, satisfies a Boltzmann equation 
ony rs ony z any pip“ O8ik Jie (1.9) 
dt dx! p® — ap; 2p® ax! 


It should be understood that p’ and p° are expressed here in terms of the 
d : à : 1/2 
independent variable p; by p’ = g” (x, t)p; and p= G (x, Pipi) , SO 
they depend on position and time as well as on pj. 


We now specialize to the case of a small perturbation. The spatial metric 
is then of the form 


gj X, t) = a? (t)ôij + gijx, t) , (H.10) 


with 5g, small. With only a small perturbation to the metric, the neutrino 
distribution function never gets very different from its equilibrium form, so 


we write 

nyt) = h (aO ygi, Opipj) +m, (LAL) 
where 

Stee oll : = 

fn) = Os [exp (pikea?) Es 1] (H.12) 
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and ôn, isa small perturbation. (Note that the factor a(t) in the argument of 
nin Eq. (H.11) is canceled by the factor a(t) multiplying T(t) in Eq. (H.12); 
this factor has been included in Eqs. (H.11) and (H.12) because at times 
of interest, when T < 10!°K, we have T(t) « 1/a(t), so that Eq. (H.12) 
defines n, as a time-independent function of its argument.) The first term 
in Eq. (H.11) is the neutrino distribution we would expect according to the 
Principle of Equivalence in a perturbed gravitational field, if the distribution 
in locally inertial frames were just the equilibrium distribution 7,(p); the 
second term 6n, thus represents the departure of the neutrino distribution 
from its equilibrium form. 

We can derive an initial condition for ên, by noting that, at a time tı 
corresponding to a temperature 7; ~ 10° K, the neutrino scattering rate had 
already dropped well below the expansion rate, but there had not yet been 
time for the perturbations in the metric to distort the neutrino distribution 
away from /ocal thermal equilibrium, but with a perturbed temperature. 
Thus as a convenient initial condition we may take 


bT(X,t1) Pk- bug (x, 2] 
T(t) a(tı) í 


ôn, (xX, p, t1) = —pi',(p) | (H.13) 


where T (t1) ~ 10°K. The second term in square brackets represents the 
Doppler shift due to a possible neutrino streaming velocity ôu, analogous 
to that given for microwave background photons by Eqs. (2.4.5) and (2.4.6). 
In local thermal equilibrium this is the same velocity perturbation as for 
baryons and photons. (Note that the metric position vector is ax“, so the 
metric velocity vector is au% = a~!ug.) 

To first order in metric and density perturbations, Eq. (H.9) reads 


_ 0dny (Xx, p, t) n ðn, (X, p, Í) Pi 

J ðt ax! a(t)p 

ny (p) 3 
2p ot 


a(t), (p) 5g" (x, t) 
+ 2p? axk PiPjPk 


0 


+ 


(ese! (x, O )pipj 


Pipjpk 95gij(X, t) 
2a@(t)p? axk 


+, (Pp) i 
where here p = „/Pipi. (The third from last and penultimate terms on 
the right-hand side arise from the dependence of the argument of n, in 
Eq. (H.11) on ż and x through the combination a?(t)g!/ (x, t)pip;.) To first 
order, we have g/ = a`? Sij — a48 gij, so the penultimate and last terms on 
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the right-hand side cancel, leaving us with the much simpler result 


ddny(X,p,t) pi 3n, (x, p, t) _ PHCP), , ô 
at a(t) ax 2 Piz; 


(assy) , 
(H.14) 


where again p = ./pxpx, and as usual we write p; = p;/p. 


2 Photons 


The Boltzmann equation for photons is considerably more complicated than 
for neutrinos, because of the necessity of taking photon scattering into 
account. Since scattering can change the polarization of photons, we can 
no longer write a separate Boltzmann equation for each helicity state of 
photons, as we did for neutrinos. Instead, we now define a number density 
matrix: 


nil (x,p.0) = > (I (xk — xf H)) (i (=ru) 10. 


r k=1 
(H.15) 


with r here labeling trajectories of individual photons, and N7 (£) the 
polarization density matrix of the rth photon. As discussed in Appendix 
G, if an individual photon can have any one of several polarization vectors 
a with probabilities P,,, then it has a polarization density matrix 


NÏ = =D Pnn el. (H.16) 


If we observe whether the polarization of a photon with polarization 
density matrix NÏ is ina particular direction e! rather than in an orthogonal 
direction, we find a probability g,gye*/e/N™. Recall that for a photon of 
three-momentum p; in a general three-metric gjj, the photon polarization 
vectors are defined so that 


pie =0, giele!* =1 (H.17) 
so the polarization density matrix of photon r satisfies 
PONO =P ONÄO=0, g(x) NËO=1, H18) 
and the number density matrix correspondingly satisfies 
pink (x, p, ) = pin x, p, N) =0, gj, n(x, p, ) =n; (% pd), 
(H.19) 
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where n, (x, p, £) is the phase space number density of photons, defined just 
asin Eq. (H.4). By their definition, both N/ (t) and n? Ix, p, £) are Hermitian 
matrices. 

As discussed in Appendix G, between collisions the time dependence of 
the polarization vectors is given by a combination of paras transport with 
a gauge transformation that keeps the time component of ey (t) vanishing: 


j . Í 5 <a 
a -r4 (x01) +73, (x0, 1) E a do | mw 


dt dt dt 


and consequently the time dependence of the density matrix of the rth pho- 
ton is 


GNP @) [u j dxi (t) 0 
L = | -ria (10.1) + rR (x01) | wh Sa) 


j J 
+ ri (x0. r) ET? (x0, 1) se nike ee ACA) 


The time dependence of the variables p,;(t) and x(t) is given by the same 
equations (H.7) and (H.8) as for neutrinos. It follows then directly from 
Eqs. (H.15), (H.21), (H.7) and (H.8) that ny (x, p, t) satisfies a Boltzmann 
equation: 


. (H.21) 


an! ðnË p k ant) p'p ™ Onn 


dt = axk p? Əpk 2p? axk 
A 0o \P k 
+(ri, rh) y 
r/ P! 0 p ik _ Ci H.22 
+ ka 0° kaj poy TS >? (H.22) 


where CÏ is a term representing the effect of photon scattering. (As a 
reminder, we note that in Ee (H.22) p’ and p° are functions of x, t, and the 
pj, given by p’ = g"/p; and p° = [g"pip;]'/.) 

To evaluate the collision term CË in Eq. (H.22), let us first consider the 
case of flat spacetime, where gj; = ôij. Since the Thomson scattering of an 
unpolarized or linearly polarized photon can only produce linearly polar- 
ized photons, we will limit ourselves here to the case of linear polarization, 
for which polarization vectors can be taken to be real, and the polarization 
density matrix and number density matrices are real and symmetric. We will 
first consider the case of electrons at rest, and later (in dealing with scalar 
perturbations) take up the case of a non-zero plasma velocity. 
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The scattering of photons of momentum p into some other direction 
causes a decrease in n} (x, p, £) which is simply given by a term in C}: 


Cl (x,p.t) = —oe(ÐnË (x, p, t) , (H.23) 


where w-(t) is the total collision rate. (We hold here to the convention of 
writing n% and CË with upper indices, though of course in flat spacetime 
there is no distinction between contravariant and covariant spatial tensors.) 
There is also an increase in ny (x, p, t) caused by the scattering of photons 
with some initial momentum p; into momentum p, with energy conservation 
requiring (assuming that |p| < me) that |p;| = |p|. The Klein—Nishina for- 
mula! tells us that when a photon with momentum pj; and real polarization 
vector ei is scattered by an electron at rest, the probability of finding a 
photon in the final state with real polarization vector e’ is proportional to 
(e-e)*, with no dependence on the initial photon momentum pj; or the final 
photon momentum p;, except of course that ei and e! must be orthogonal to 
pii and p;, respectively. If the initial photon can have various polarizations 
el, with probabilities P,, then correspondingly the probability of finding 
the photon in the final state with polarization e! is proportional to e'e! NY is 


where NË = $P ne nen As already mentioned, this probability must 
equal e igi NÏ, where NË is the polarization density matrix of the photon in 
the final saie Together with the conditions that p;NËÏ = 0 and NË = 1, 
this tells us that the scattered photon has polarization density matrix 


ijy A PE j oan kj aa i aaa a 
N4(p) = S| NË — DPN — PiPeNE + biPibePN |, 2 
where x 
S(p) = 1—pipjN/ . (H.25) 


The rate of increase in nË (x, p, 7) due to scattering of photons from an 
arbitrary direction p; into direction p is then 


2o 


d*p 
x S'(p)| n(x, [pl D — Bide nl (x, pli. 


C! (x, p.t nan [eis 


— Pybi n” (x, plô, D + PiPjPxbi nt (x. |plp1, | .  (H.26) 


where ne is the electron density, and do/d7f is the differential scattering 
cross section. Summing (e - e1)? over final polarizauong gives a differential 
cross section proportional to S(p), and f dP S(p) = 82/3, so equating 


'See, e.g., QTF, Vol. I, Sec. 8.7. 
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f d*p (d?o /d?P) to the Thomson scattering cross section oz, we have 


do 30r 
=~ = —_S(p). H.27 
2p Bn (p) (H.27) 


Then Eqs. (H.23), (H.26), and (H.27) give a net change in nË (x, p, t) 
equal to 


CÏ, p, ) = Cx, p, ) + Cl, p.d 


= -onip ) + R fap, 


8r 
x [nbcx, Iplĝ1, Ð — Pip nid (x, IplP1.0 — PiPr nik (x, Iplp1.0) 
+PiPjPxbi nk! (x, plô, | (H.28) 


in which we use w,(t) = ne(t)or. 

As a check, it is easy to see that the collision term (H.28) vanishes in 
local thermal equilibrium. In equilibrium photons are unpolarized, so n} « 
dij — pip;, and their momentum distribution is homogeneous and isotropic, 


so the coefficient of proportionality f depends only on |p| and t: 


nip, D =F (pl, D| — Pid] - 
Then 
Qn ij A 870 
dP 1Meq(X, [PIP ) = Sf (pl 54 - 


Using this in Eq. (H.28) shows that in equilibrium the two terms in CY 
cancel, as of course they must. 

To find the result for an arbitrary three-metric gj;, we must simply write 
Eq. (H.28) in a form that is invariant under general three-dimensional 
coordinate transformations, and reduces to Eq. (H.28) in the case of a flat 
three-metric with g; = 5;;. In this way, we find 


CÌ (x, p, t) = —@(t)n'! (x, p, t) 
RECIO) d>p,./Detg(x, t 

8x p’?(x, p, t) 
gE, DPkPI ij 


x [ajo pi t) — eng ” (x, pi, 4) 


5 (P(x, p, © — p(x, pi, 1) 


ik 
g (x, OpPKpe i 
zS NED 
pP“, p, t) 


ik il 
„Dg X,t H 
g(x = (X, 1) PKPIPmPn n(x, Pi. Ðl, ( .29) 
p(x, p, t) 
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P(x, p, © = J gÏ x, Opip; . (H.30) 


We now again specialize to the case of a small perturbation, writing the 
metric as in Eq. (H.10), and putting the photon number density matrix in a 
form analogous to Eq. (H.11): 


where 


ik jl 
n'i(x,p, t) = b a(t)p°(x,p,t)) | gË, t) — g x D8” & Oppi 
2 


p’? (x, p, À 
+ bn} (x, p, t) , (H.31) 


where 7,(p) is the equilibrium phase space number density, a time- 
independent function of its argument 


aD = os [ee (pror) — 1]. 3D 


and ôn? is a small perturbation. Fortunately, to first order in perturbations 
the Boltzmann equation (H.22) is greatly simplified by the fact that the 
quantity in square brackets in Eq. (H.31) satisfies a collisionless Boltzmann 


equation 
9 k 9 lym g ə . ini 
o= [2 2 PP" 38m pate 
ət =p’ axk 2p? axk apk p? 


a i kpj 

P ( i P 2) kj PP 
+— (TT), aT gI —-— 

p? \ kA ph d 


X , j ink 
p AE E 
u (ri, - are] Ẹ - r te) 


P 


(This can be proved directly, or more easily by noting that if we set p; = pri (t) 
and x! = xİ(t), then the quantity in square brackets is the sum over two 
orthogonal polarizations of the quantity el (the; (t), and therefore satisfies 
Eq. (H.33) as a consequence of Eq. (H.20). Since Eq. (H.33) holds in this 
sense for any photon trajectory, it is necessary for it to hold for arbitrary x! 
and p;.) Acting on the factor n, (ap), the derivative operators in Eq. (H.22) 
give to first order 


a k 9 lmg 3 
+P ae a sim 
ot pY ax 2p’ əx" dp 


1 
ity (ap?) = —5VPiPi 
xin’ Pip Pir (a ~*5gu1) ; (H.34) 
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with the effect of the position and momentum derivative terms in parenthe- 
ses on the left-hand side canceling. There is another simplification provided 
by the fact that the the first term in Eq. (H.31) does not contribute to the 
collision term, since even in a gravitational field collisions by themselves 
do not alter equilibrium distribution functions. Keeping only terms of first 
order in perturbations, the Boltzmann equation (H.22) then becomes 


IEn , pe DdNE(XP.1) | 2a(t) 


én! (x, p, t 

at a(t) ax a ny (X, p, t) 
Leh eee es o 
aan” (Dpxbr— (a Og (X, D) Q -— bij) 


ij Bwe t) x 
= —we(t) ôn} (x, p, t) + sd [em 


x dnl (x, pôi, D — Bide Suk’ (x, pôi, D — Bide Sn¥ (x, pôi, D) 
+biPiPupr 5nk! (x, ppa D] (H.35) 


where again p = ,/pip;, and p = p/p. As a check, note that Eq. (H.35) is 
consistent with the condition p;én!/ = 0. As a further check, note that the 
first-order perturbation to the photon phase space number density gyn? is 
a’én", and according to the trace of Eq. (H.35), the Boltzmann equation 
for a*dnit is the same as the Boltzmann equation (H.14) for the perturba- 
tion to the neutrino phase space density, aside from the presence of the 
collision term and the appearance of 7, instead of 7. 

For tensor modes, Eq. (H.35) can be used as it stands, but in dealing with 
scalar (or vector) modes, we need to take up a complication that has been 
ignored until now: in general the plasma has a small velocity u(x, £) (which 
in Section 6.1 et seg. is denoted dug(x,1)). Since ôu is itself a first-order 
perturbation, in calculating its effect on the collision term CË, to first order 
we can ignore any perturbations to the gravitational field or the photon 
number density matrix. The effect of the plasma velocity is to shift the 
energy |pi| of the incident photon that when scattered yields a final photon 
with momentum p. To calculate this effect, let’s first consider the case of 
flat spacetime. Since photon energy is conserved in the rest frame of the 
plasma, in which the plasma metric four-velocity is simply v” = (0,0, 0, 1), 
we have (pı — p)- v = 0; evaluating this scalar in the “lab” frame in which 
the plasma velocity four-vector is v“ = [êv, y 1 — (8v)?] we have 


0 = va (pi — P)! = dv - (Pı — p) — V1 — (v)2(p? — p°) , 
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or, to first order in velocity, 
IP11 = Ipi[ 1 + (1 =A) +v] . 


As already noted, the flat-space velocity components v* are related to the 
velocity components in a Robertson—Walker three-metric gj, = a sii by 
ve = au“ = aug, so 
(Pi — P)kôUk 
pil = pi [1 + AE], (H.36) 
As a result, the two terms in the collision term (H.28) do not cancel even 
when we set the three-metric gj equal to a75; and set the photon number 
density matrix equal to the form it would have in equilibrium with this 
three-metric: 
nË, eq (x, p, t) = 


zag” D [s — ĵiĝ;| - 


Since in this case these two terms in C Ü would cancel if |p1| were equal to |pl, 
we find a new term in C(x, p, t), equal to the difference between the term 


—we(t)n! eg (x, p, t) and the same term with |p| replaced with |p1|, averaged 
over p,. The linearized Boltzmann equation (H.35) now becomes 


ðt a(t) ‘aot a(t) 


pit, OEA (a? (dgu(x, D) (51 = Pihi) 


bn} (x, p, 0) 


1 
4a? (t) 
= -oe nia, p d + [arp 
8x 


x [snl pi. t) — PiPk ôn (x, pôi, t) — PiPK ôn” (x, PPL, t) 
+DiPiPkbr 6nk! (x, pp1, n] 


(pxõur(x, D) ii, (p) 5, a Pi] l H37) 


(The term in Eq. (H.36) proportional to p1;,du; gives no contribution to the 
integral over p1.) 


x k k 


In an important application of Eq. (H.37), we may derive the decay rate 
of acoustic waves in a homogeneous and time-independent plasma in flat 
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spacetime (with a = 1 and wc time-independent) when the collision rate is 
much larger than the sound frequency. (This result will be used in Section 
6.4.) We seek a solution of the Boltzmann equation (H.37) of the form 


bn (x, pt) = eX elt Sn (p) , duj(x,t) = gre lsu: : (H.38) 


Then with a = 1 and œc constant, Eq. (H.37) becomes 


[oc — io + if- k| ônË (p) 


Bwe 
= dp 
87 Pl 
x [ani (pP1) — PiP Suh! (P1) — PjPx 8n” (PP1) + PiPjPdi 8n! (.)] 
w = AOA 
-= (p: su) apès — pid] - (H.39) 


We now need a formula for the plasma velocity du;. For this purpose, we 
note that in flat space the first-order photon and baryonic plasma 
contributions to the energy-momentum tensor have 


ôT} = J Bp an) plpibj, 6T) = J dp a?ntk (p) |p|; , (H.40) 
and 
Tb; =0, 8TH; = bpdu;, (H.41) 
so the equation of momentum conservation may be written 
w ppouj = J dp snk (p) p; (k oe o) . (H.42) 


To evaluate the right-hand side, multiply Eq. (H.39) with p; and integrate 
over p; this gives 


A Awe. - 
i f Pp œp- opono) = owe f adp pant) + swa, , 


where py = f Ëp pñ, (p) = —(1/4) f dpp’ n,(p). Using this in Eq. (H.42) 
gives our formula for ôu: 


M 4iwe _ , 
(e B + Hae, ôuj = iwc [or pini (p) f (H.43) 


For scalar modes, k is the only parameter in the problem with a sense 
of direction, so we can express the integrals appearing in Eqs. (H.39) and 
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(H.43) as 
J ew ônË (p) = py [xs + ykik;| : (H.44) 
J d’p dni! (p) pj = Py Zk; , (H.45) 


where X, Y, and Z are here functions of k = |k| and w. Eq. (H.43) then 
lets us express ôu in terms of Z: 


3Z a 


su = ——"— —_& 
coe’) RT 


(H.46) 


where R = 30g/4(,, and te = 1/we is the mean time between collisions. 
Using Eqs. (H.44) and (H.46) in Eq. (H.39) then yields a formula for the 
perturbed intensity: 


e 35 Z(ĝ-k a 
An J ôn? (pp)prdp = a [( + zee) (èy - pip) 


+Y (Ri — PeP $) (Èi -DG i)| / (47) 


By inserting this back in the definitions (H.44) and (H.45), we find three 
homogeneous linear relations among X, Y, and Z. For these to be 
consistent, the determinant of the coefficients must vanish, which yields 
a relation between œ and k = |k]. 

The resulting dispersion relation is quite complicated, but it becomes 
much simpler in the case of small mean free time te. As an ansatz, we can 
try taking Y/X of first order in te, while Z/X is of order unity, leaving it for 
later to check whether this leads to a consistent solution. To second order 
in te, the terms in Eq. (H.44) proportional to 6; and kik; and the coefficient 


of k; in Eq. (H.45) yield the homogeneous linear relations 


, 2 1 ; 
0= (ior -ot — een) X+ ol + iwt,) Y 


2 
$ =( — ikte + wkt2(2+R)) Z (H.48) 
tas 3 Ti 
= _K272x ia pede MY 
0 5K be +( at gore) Y 
2+R 
+ (ir a : eok) Z (H.49) 
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0= ( — ikte + 2wkt?) X — zikte Y 
3 

Ma (roa + R) — œt? (1 + R+ R?) — se) Z. (H.50) 
(As anticipated, these relations are consistent with the assumption that Y 
is of order te relative to X and Z.) To first order in te, the vanishing of the 

determinant of the coefficients in these relations yields the condition 

0 = 1501 + Ro? — 5k? + ioòte| — 5 — SR + 15R?°| + Tiok te. 


This has the solution, again to first order in te: 


k 


= +———— -il, H.51 
J3 + R) ved 
where T is the decay rate 
kte |16 R 
= ; H.52 
gin ett aa 


This is the same as the formula (6.4.25) for the damping rate given in Section 
6.4, originally derived by Kaiser.? As discussed there, it is equivalent to the 
formulas (6.4.24) for the coefficients of shear viscosity and heat conduction 
ofa non-relativistic plasma in which momentum and energy are transported 
by photons. 


2N. Kaiser, Mon. Not. Roy. Astron. Soc. 202, 1169 (1983). 
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Glossary of Symbols 


(These are symbols used in more than one section. Numbers following the 
symbol give the section in which the symbol is first used.) 


a 1.1 scale factor in Robertson—Walker metric 
ag 2.1 radiation energy constant 
dem 2.6 temperature partial wave amplitude (same as aT em) 
A 5.1 one of scalar perturbations to metric 
B 3.3 baryon number 
5.1 one of scalar perturbations to metric 
6.1 subscript denoting baryonic plasma 
C; 5.1 one of vector perturbations to metric 
Ce 2.6 temperature multipole coefficient (same as Crr.e) 
d 1.2 proper distance 
d4 1.4 angular diameter distance 
dy 2.6 horizon distance 
dg 1.4 luminosity distance 
D 6.1 subscript denoting cold dark matter 
D© 7.4 representative of the rotation group for spin £ 
Dj 5.1 tensor perturbation to metric 
Dı 5.2 gravitational wave amplitude 
D? 6.6 constant gravitational wave amplitude outside horizon 
ej 5.2 graviton polarization tensor 
E 5.1 one of scalar perturbations to metric 
EQ 6.2 subscript indicating the time of matter—radiation equality 
F 5.1 one of scalar perturbations to metric 
6.6 function appearing in tensor anisotropic inertia 
7.1 form factor in scalar temperature fluctuation 
g 3.1 multiplicity of states 
Suv 1.1 spacetime metric 
gj 1.1 spatial part of Robertson—Walker metric 
G 1.5 Newton’s gravitational constant 
7.1 form factor in scalar temperature fluctuation 
G; 5.1 one of vector perturbations to metric 
h 1.3 Hubble constant in units of 100 km sec~! Mpe~! 
2.44 Planck’s constant = 27h 
hyy 5.1 perturbation to the metric 
H 2.2 expansion rate a(t)/a(t) 
Ho 1.2 Hubble constant a(to)/a(to) 
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Glossary of Symbols 


Hankel functions 

action 

contribution of integrated Sachs—Wolfe effect 
spherical Bessel function 

dimensionless neutrino intensity 

current four-vector 

dimensionless photon intensity matrix 

physical wave number at present = q/ao 
Boltzmann constant 

curvature constant in Robertson—Walker metric 
function appearing in tensor anisotropic inertia 
apparent luminosity 

multipole order 

absolute luminosity 

subscript indicating time of last scattering 
lepton number 

apparent magnitude 

particle mass 

absolute magnitude 

total mass 

million parsecs 

number density 

photon number density matrix 

slope parameter in Rå 

coefficient of g~*/* in R? 

effective number of particle types 

number of e-foldings in inflation 

pressure 

parsec 

Legendre polynomials 

power spectral function of density fluctuations 
power spectral function of gravitational potential 
co-moving wave number vector 

deceleration parameter 

radial coordinate in Robertson—Walker metric 
tensor/scalar ratio 

3/4 the ratio of baryon to photon energy density 
Ricci tensor 

curvature perturbation 

constant curvature perturbation outside horizon 
proper length 

entropy density 
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S 71 
Suv 1.5 
SA) 7.4 
S 65 
SW 2.6 
t 1.1 
to 1.5 
ti 6.1 
T 21 

7.1 
Tyy 1.1 
T 65 
u 1.1 

5.1 
Uy 5.1 
U 6.6 
U 2.6 
V 1.12 
w 1.1 
X 1.1 
X 23 
X 19 
y 63 
Y; 2.6 
V 14 
Z 1.2 

10.1 
a 5.2 

9.1 
B 5.2 

6.4 

9.1 
Be 9.1 
y 6.1 

6.4 
r 64 
Pie Ast 
Tik 1.1 
ôy L1 
ôyq 6.2 
A 5.3 


Glossary of Symbols 


superscript indicating scalar mode 

source term in Einstein field equation 
standard rotation from three-axis to direction ñ 
a scalar transfer function 

contribution of Sachs—Wolfe effect 

time 

present age of universe 

initial time in solution of Boltzmann equation 
temperature 

superscript indicating tensor mode 
energy-momentum tensor 

the main scalar transfer function 

affine parameter 

velocity potential 

velocity four-vector 

a tensor transfer function 

sound speed 

scalar field potential 

ratio of pressure to density 

co-moving spatial coordinate 

fractional ionization 

Newtonian spatial coordinate 

ratio of matter and radiation densities= a/ago 
ordinary spherical harmonics 

spin +2 spherical harmonics 

redshift 

quantity ag/H in Mukhanov-Sasaki equation 


stochastic parameter of scalar modes 

angle between gravitational lens and true source positions 
stochastic parameter of tensor modes 

ratio of baryon and total matter densities 

angle between gravitational lens and apparent source positions 
angular radius of Einstein ring 

subscript denoting photons 

Euler constant 

acoustic decay rate 

affine connection 

spatial affine connection for Robertson—Walker metric 
Kronecker delta 

ôpxq/(Px +px) (for X = B,D,y,v) 

change due to gauge transformation 
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6.5 
A® 6.1 
AONE A 
AP 6.6 
AD 6.6 
€ 10.1 
co ,€5.3 
¢ 54 
n 3.2 
Nuv 1.5 
0 1.1 
K 6.2 
u 3l 
v 1.2 
6.1 
a 6.6 
Tij 1.5 
xS 1.5 
xY 1.5 
mi 1.5 
m 6.1 
p 1.1 
o 1.10 
2.2 
8.1 
T 1.1 
1.10 
5.3 
10.1 
@ 1l 
2.6 
gy 1.12 
® 5.3 
6.1 
w 53 
y 53 
6.6 
Qg 1.5 
Qu 1.5 
Qr 1.5 
Qa 1.5 


Glossary of Symbols 


a scalar transfer function 

scalar photon temperature amplitude 
scalar photon polarization amplitude 
tensor photon temperature amplitude 


tensor photon polarization amplitude 

-H /H? 

gauge transformation parameters 

perturbation conserved outside horizon 
baryon/photon ratio 

Minkowski spacetime metric 

polar angle (except in Sec. 9.5) 

dimensionless rescaled wave number 

chemical potential 

frequency 

subscript denoting neutrinos & antineutrinos 

a tensor transfer function 

total anisotropic inertia tensor 

scalar anisotropic inertia 

vector anisotropic inertia 

tensor anisotropic inertia 

a scalar source function 

energy density 

cross section 

entropy per baryon 

standard deviation of density perturbations 
proper time 

optical depth 

parameter for transformation between synchronous gauges 
conformal time 

azimuthal angle (except in Sec. 9.5) 
gravitational potential 

scalar field 

a scalar metric perturbation in Newtonian gauge 
a scalar source function 

a scalar metric perturbation in synchronous gauge 
a scalar metric perturbation in Newtonian gauge 
tensor source function 

-K/a H? 

ratio of matter density to critical density 

ratio of radiation density to critical density 

ratio of vacuum density to critical density 
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Assorted Problems 


. Consider a universe described by a Robertson—Walker metric with 
K = +1. Give a transformation of co-moving space coordinates that 
leaves the metric unchanged, and that takes the point x = (0, 0,7) into 
a point x = (0,0,7’), with no change in the time. (Hint: Consider 
the three-dimensional space as the surface of a four-dimensional ball, 
construct this transformation as a rotation in four dimensions, and 
then express it in terms of the Robertson—Walker coordinates.) Also 
give the corresponding transformation for K = —1. 


. Suppose that the total pressure and energy density of the universe are 
related by p = —p+/7/ 1, where pı isa constant. Assume zero spatial 
curvature. Calculate o as a function of the Robertson—Walker scale 
factor a, taking a = a; when p = oo. Calculate a and p as functions 
of time, taking ¢ = 0 as the time when pọ = oo. Calculate the age 
of the universe and the deceleration parameter go for a given present 
density pọ. 


. Consider the empty cosmology, with Qm = Qr = Qa = 0. Calculate 
the luminosity distance and angular diameter distance as functions of 
redshift in this cosmological model. What is the age of the universe as 
a function of the present Hubble constant? 


. Suppose that astronomers measure the age of a galaxy with redshift 
z = 2.5. How old would this galaxy have to be (at the time the light 
from it was emitted) in order to rule out the hypothesis that Qm = 1 
with negligible vacuum and radiation energy density. Use Hp = 70 
km/sec/Mpc. 


. Suppose that Qm = 0.25 and Qy = 0.75, with Qr negligible. What is 
the redshift at which the expansion of the universe stopped decelerating 
and began to accelerate? 


. Suppose that the gravitational potential energy of any pair of galaxies 
with separation r decreases as r~” instead of r~!. What combination 
of the mass of a virialized cluster of galaxies and the Hubble constant 
could be calculated from measurements of angular separations and 
velocity dispersions of its individual galaxies? 


. Suppose that the fluctuations in the temperature of the cosmic 
microwave background are governed by a Gaussian isotropic 
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Assorted Problems 


translation-invariant probability distribution. Calculate the quantity 


(co = c) hc 


as a function of £. 


. Suppose that there is some new massless elementary particle that 


effectively decouples from everything else at a temperature 10!° K. 
Suppose that the particles with mass less (and in fact much less) than 
1 TeV are those of the minimal Standard Model: three generations of 
quarks, leptons, antiquarks, and antileptons with helicity +1/2; the 
W= and Z? with helicity +1 and 0; the photon and eight massless 
gluons with helicity +1; and a single neutral boson of helicity zero; 
together with their antiparticles (where distinct from the particles). 
What is the ratio at present of the temperature of the new massless 
particles and the temperature of photons? 


. Suppose that the binding energy of deuterium were 5% less. Estimate 


the effect that this would have on the abundance of Het produced in 
the early universe. 


. Suppose that instead of B — L, only the combination 3B — L of 


baryon number and lepton number were conserved, along with the 
usual conserved quantities of the Standard Model. Taking account of 
the particles of the minimal (non-supersymmetric) Standard Model, 
what is the density of baryon number and lepton number in thermal 
equilibrium for a given density of 3B — L? 


. Suppose that the Robertson—Walker scale factor increases during 


inflation by a factor e*°; that the scale factor at the beginning of 
the radiation-dominated era was the same as at the end of inflation; 
that the energy density at the end of inflation was [2 x 10!6GeV]f; 
and that the energy density at the beginning of the radiation-dominated 
era was [2 x 10!°GeV]*. Over what range of angles would you expect 
the cosmic microwave background now to have a nearly uniform 
temperature? 


. Use the “slow-roll” approximation to calculate the inflaton field (t) 


as a function of time for a potential V (p) = gg”, where g and n are 
positive constants. 


. Find the two independent solutions of the field equations and conser- 


vation equations for scalar perturbations far outside the horizon in 
Newtonian gauge in the case of a perfect fluid having p = wp, with 
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Assorted Problems 


w constant. Use the results to calculate R = —W + H du for each 
solution. 


. Consider the equations for scalar fluctuations in Newtonian gauge in 


a universe containing only cold dark matter. Find the two solutions 
of the coupled equations for ®, and dug. (You can normalize these 
solutions any way you like, but keep it simple.) For each solution, 
calculate R4, dog, and ¢,. Do not make any assumption about the 
magnitude of the wave number. 


. Consider a vector field 


VE (x,t) = VY (t) +5V"(x, 2), 


where the unperturbed space components VÝ are zero, and the 
unperturbed time component V° is some non-zero function only of 
time. What is the change in 6V° and 6V' induced by a general gauge 
transformation? 


. Re-do the analysis of Section 5.2, but for vector fluctuations. 


. Suppose that the differential scattering cross section of neutrinos on all 


other particles were a constant, independent of direction or energy. (It 
isn’t.) For a given total neutrino collision rate w,(t), derive the Boltz- 
mann equation that would be satisfied by the neutrino phase space 
distribution function 7, (x, p, £) in a spacetime metric with goo = —1, 
gio = 0, and gj(x, t) arbitrary. For the case where g; (x, t) = a’ (t) dj + 
ôgij (x, t), derive the Boltzmann equation for the phase space density 
perturbation ôn,(x, t) defined by Eq. (6.1.39), to first order in ôg; 
and ôn. 


. Suppose that Ho were doubled, and Qg and Qy reduced by a factor 


four. What would be the qualitative effect on the plot of €7C; vs. £ for 
£17 


. Assume that Cr, , 1s dominated by a co-moving wave number at last 


scattering equal to £/rz, where rz is the co-moving radial coordinate 
of the surface of last scattering. Also, neglect the Doppler effect, and 
use S = 5 and 7 = 1 for the transfer functions. With these assump- 
tions, give an approximate formula for the ratio of the heights of the 
first and second acoustic peaks in Chr. 7 


What value of the multipole order £ corresponds to a co-moving 
wavelength equal to the diameter of a co-moving sphere that encloses 
the total mass 10!5 Mọ (including cold dark matter) of a cluster of 
galaxies? 
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22. 


23. 


24. 


29: 


26. 


Assorted Problems 


Define a mean square matter density fluctuation as 


v= (mf? xaman) } 
2 


where the integral extends over all space. At what time does op 
become equal to 0.1 for R equal to the diameter of a co-moving sphere 
that encloses the total mass 10!? Mo (including cold dark matter) of 
a large galaxy? Assume a Harrison—Zel’dovich primordial spectrum 
with |N| = 2 x 107°. 


Derive a formula for the multipole coefficients in the correlation 
between fluctuations in the cosmic microwave background temper- 
ature in a direction 7 and the cosmic mass density fluctuation at a 
redshift z and direction 7’. Use any approximation that you think is 
reasonable, but make it clear what approximations you are using. 


What is the effect of a general infinitesimal rotation on the polariza- 
tion tensor ej(#, +2) of a gravitational wave travelling in a direction n 
along the 1-axis? 


Suppose that the probability distribution of tensor fluctuations 
respected the symmetries of translation and rotation invariance, but 
not invariance under space inversion. What would that imply for the 
average (B(q,A)6*(q’,4’)), where B(q, A) is the stochastic parameter 
for wave number q and helicity 4? Give a formula for the multipole 
coefficient Cfp , in this case. 


Suppose that light passing a galaxy is deflected toward the center of 
the galaxy by an angle A/b”, where A and n are constants, and b is 
the distance of closest approach of the light to the galaxy center. Find 
the lens equation, i.e. a relation between the observed angle 6 between 
the lines of sight from the earth to the lensing galaxy and a more dis- 
tant source of light and the angle a that there would be between these 
lines of sight if light were not deflected by the galaxy. Give a formula 
for the apparent luminosity of the image of the source in terms of £. 


Define multipole coefficients Cy g,e by 
(ay ema wm) = See mm Cy Ee » 


where ay em and ag em are the coefficients in the partial wave expansion 
of the shear and the Stokes parameters, defined in Sections 9.5 and 
7.4. Give a formula for the correlation functions (y1 (A) O(n’)), 
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28. 


Assorted Problems 


mA OAD, (MAAUA, (2A) UR)) of the shear and Stokes 
parameters in terms of these multiipole coefficients. 


Assume that inflation is driven by a single inflation field gy, with a 
potential V (p) = gexp(—Ag’). What are the conditions on à and 
ọ to allow the use of the slow-roll approximation? Assuming these 
conditions to be satisfied, give values for the scalar and tensor slope 
parameters ns and nr. By how many e-foldings does the universe 
expand when ọ changes by a factor 100? 


In multi-field inflation, what is the dependence of scalar field pert- 
urbations on wave number outside the horizon when you take account 
of a slight variation of the potential during the era of horizon exit? 
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